TITLE OF THE INVENTION 
ALPHA-CONOTOXIN PEPTIDES 

CROSS-REFERENCE TO RELATED APPLICATION 

The present application is related to U.S. provisional patent application Serial No. 
5 60/1 1 8,38 1 , filed 29 January 1 999, incorporated herein by reference. 

This invention was made with Government support under Grant No. POl GM48677 awarded 
by the National Institute of General Medical Sciences, National Institutes of Health, Bethesda, 
Maryland. The United States Government has certain rights in the invention. 

BACKGROUND OF THE INVENTION 
10 The invention relates to relatively short peptides (termed a-conotoxins herein), about 10-30 

residues in length, which are naturally available in minute amounts in the venom of the cone snails 
or analogous to the naturally available peptides, and which preferably include two disulfide bonds. 

The publications and other materials used herein to illuminate the background of the 
invention, and in particular, cases to provide additional details respecting the practice, are 
1 5 incorporated by reference, and for convenience are referenced in the following text by author and 
date and are listed alphabetically by author in the appended bibliography. 

The predatory cone snails (Conus) have developed a unique biological strategy. Their 
venom contains relatively small peptides that are targeted to various neuromuscular receptors and 
may be equivalent in their pharmacological diversity to the alkaloids of plants or secondary 
20 metabolites of microorganisms. Many of these peptides are among the smallest nucleic acid- 
encoded translation products having defined conformations, and as such, they are somewhat 
unusual. Peptides in this size range normally equilibrate among many conformations. Proteins 
having a fixed conformation are generally much larger. 

^^^r^rcCT» 0 s n ails th at p rod taco th cac p epti d es are a lar ge g e nus o£ - 
25 comprismg approximately 500 species. All cone snail species^ar§..pfed:at0frtEat inject venom to 
capture prey, and the spectrum ofamjBals-lliaTTHegenus as a whole can envenomate is broad. A 
sa^^^^ ^^ a Uem ^^^^ ^^u^^^ however, every Conus species uses fundamentally the 

Several peptides isolated from Conus venoms have been characterized. These include the 
30 a-, ^i- and a)-conotoxins which target nicotinic acetylcholine receptors, muscle sodium channels. 




and neuronal calcium channels, respectively (Olivera et al., 1985). Conopressins, which are 
vasopressin analogs, have also been identified (Cruz et aL. 1987). In addition, peptides named 
conantokins have been isolated from Conus geographus and Conus tulipa (Mena et aL, 1 990; Haack 
etal., 1990). 

ThQr (irt^ onoto x ins-af©-smaB^c pliUes hi ghly speci fi c fo i ueuiur r 





acetylcholine receptors (Gray et aL, 1 98 1 ; Marshall and Harvey, 1990; Blountjtairrt9'92; Jacobsen 
et aL, 1997) or highly specific for neuronal nicotinic acetylcholiije-r^6eptors (Fainzilber et aL, 1994; 
Jolinson et aL, 1995; Cartier et aL, 1996; Luo et aL, l^P-S^ The a-conotoxins with specificity for 
neuromuscular junction nicotinic acetylcholipe'feceptors are used as neuromuscular blocking agents 

10 for use in conjunction with surgen^^^afSaisclosed in U.S. patent application Serial No. 09/ , 

filed 21 January 2000 (AttpmSy Docket No. 23 14-178. A) and international patent application No. 
PCT/USOO/ _^^H€^21 January 2000 (Attorney Docket No. 2314-138.PCT), each incorporated 
by refererjGeiierein. Additional a-conotoxins and uses for them have been described in U.S. Patent 
N€^srV I1 7 -^3^6-(Gti^^ 5, 432, 155 T5:5T4;??4reael»4^^ by rcfe fettefe^ 

15 Additional uses for a-conotoxins are described in U.S. Serial No. 09/219,446, filed 22 

December 1998, incorporated herein by reference. In this application, a-conotoxins with specificity 
for neuronal nicotinic acetylcholine receptors are used for treating disorders regulated at neuronal 
nicotinic acetylcholine receptors. Such disorders include, but are not limited to, cardiovascular 
disorders, gastric motility disorders, urinary incontinence, nicotine addiction, mood disorders (such 
20 as bipolar disorder, unipolar depression, dysthymia and seasonal effective disorder) and small cell 
lung carcinoma, as well as the localization of small cell lung carcinoma. 

It is desired to provide additional a-conotoxin peptides having uses as described herein. 
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SUMMARY OF THE INVENTION 

The invention relates to relatively short peptides (termed a-conotoxins herein), about 10-30 
residues in length, which are naturally available in minute cimounts in the venom of the cone snails 
or analogous to the naturally available peptides, and which preferably include two disulfide bonds. 
More specifically, the present invention is directed to a-conotoxin peptides having the 
eneral formula I: 

Xaay*J4aa;=S to4 Xqq;, Cya-r . ys-Xft ^rXaar-XHg's-Aaag -C^s-Xaaio-Xaair - Xag ^^^s^ 




Xaa,3 (SEQ ID NOl :), wherein Xaa|Js-des-Xaai7lle, Leu or Val; Xaaj is des-Xaa,, Ala or Gly; Xaa3 

QjLL)T-Qee^|J, lialu-^'l'rp u i d\\f umialuidl diumd l iL aiiiiiio a J icC IX aa^??rles- . 
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9feay , - A ^ p, PliL, 01> ; Ala, Glu, y-ca!bO>i.y -Glu (Gla) or any unnaturaham matic amino acid; Xaag ig 
Glu, Gla, Asp, Ala, Thr, Ser, Gly, He, Tyr, nor-Tyr, mono-halo-Tyr, di-halo-Tyr, 0-sulpho;;T5a*, O- 
phospho-Tyr, nitro-Tyr or any unnatural hydroxy containing amino acid; Xaag is Thr, Arg, 
ornithine, homoarginine, Lys, N-methyl-Lys, N,N-dimethyl-Lys, N,N,N-tr}i^thyl-Lys or any 
5 unnatural basic amino acid; Xaa7 is Asp, Glu, Gla, Arg, ornithine, homo^^^inine, Lys, N-methyl- 
Lys, N,N-dimethyl-Lys, N,N,N-trimethyl-Lys or any unnatural basic^afmino acid; Xaag is Ser, Thr, 
Asn, Ala, Gly, His, halo-His, Pro or hydroxy-Pro; Xaa9 is Thr, S^, Ala, Asp, Asn, Pro, hydroxy- 
Pro, Arg, ornithine, homoarginine, Lys, N-methyl-Lys, N,N/aimethyl-Lys, N,N,N-trimethyl-Lys 
or any unnatural basic amino acid; Xaa,o is Gly, Ser, Thr/Ala, Asn, Arg, ornithine, homoarginine, 

10 Lys, N-methyl-Lys, N,N-dimethyl-Lys, N,N,N-trirn^tnyl-Lys or any unnatural basic amino acid; 
Xaai, is Gin, Leu, His, halo-His, Trp (D or L), halo-Trp, neo-Trp, Tyr, nor-Tyr, mono-halo-Tyr, di- 
halo-Tyr, O-sulpho-Tyr, O-phospho-Tyr, nitro/ryr, Arg, ornithine, homoarginine, Lys, N-methyl- 
Lys, N,N-dimethyl-Lys, N,N,N-trimethyl/Lys, any unnatural basic amino acid or any unnatural 
aromatic amino acid; Xaa,2 is Asn, His/halo-His, He, Leu, Val, Gin, Arg, ornithine, homoarginine, 

15 Lys, N-methyl-Lys, N,N-dimethyKLys, N,N,N-trimethyl-Lys or any unnatural basic amino acid; 
Xaa,3 is des-Xaa,3, Val, He, Leu, Arg, ornithine, homoarginine, Lys, N-methyl-Lys, N,N-dimethyl- 
Lys, N,N,N-trimethyl-Ly^r any unnatural basic amino acid. The C-terminus may contain a free 
carboxyl group or an/^ide group. The halo is chlorine, bromine or iodine, preferably iodine for 
Tyr and His and Bif^ferably bromine for Trp. The Cys residues may be in D or L configuration and 

20 may optionaKy be substituted with homocysteine (D or L). The Tyr residues may be substituted 
with the^3 -hydroxy 1 or 2-hydroxyl isomers and corresponding O-sulpho- and O-phospho- 
deriv^ves. The acidic amino acid residues may be substituted with any synthetic acidic bioisoteric 

■^atwtTO roCTds j ytir^ e.g., ieli u i^u lyl dcrivCTtiA'CD of GLy Liiid " >Vla. 

More specifically, the present invention is directed to a-conotoxin peptides having the 

25 general formula II: 

Xaa,-Xaa2-Xaa3-Xaa4-Cys-Cys-Xaa5-Xaa6-Xaa7-Xaa8-Cys-Xaa9-Xaa,o-Xaaii-Xaa,2-Xaa,3- 
Xaa,4-Cys-Xaa,5-Xaa,5-Xaai7 (SEQ ID NO:2), wherein Xaai is des-Xaaj, Asp, Glu or y-carboxy-Glu 
(Gla); Xaa2 is des-Xaa2, Gin, Ala, Asp, Glu, Gla; Xaa3 is des-Xaa3, Gly, Ala, Asp, Glu, Gla, Pro or 
hydroxy-Pro; Xaa4 is des-Xaa4, Gly, Glu, Gla, Gin, Asp, Asn, Pro or hydroxy-Pro; Xaa5 is Ser, Thr, 

30 Gly, Glu, Gla, Asn, Trp (D or L), neo-Trp, halo-Trp, Arg, ornithine, homoarginine, Lys, N-methyl- 
Lys, N,N-dimethyl-Lys, N,N,N-trimethyl-Lys, any unnatural basic amino acid, Tyr, nor-Tyr, mono- 
halo-Tyr, di-halo-Tyr, O-sulpho-Tyr, O-phospho-Tyr, nitro-Tyr or any unnatural hydroxy containing 



amino acid; Xaa^ is Asp, Asn, His, halo-His, Thr, Ser, Tyr, nor-Tyr, mono-halo-Tyr, di-halo-Tyr, 
O-sulpho-Tyr, O-phospho-Tyr, nitro-Tyr or any unnatural hydroxy containing amino acid; Xaa? is 
Pro or hydroxy-Pro; Xaag is Ala, Ser, Thr, Asp, Val, He, Pro, hydroxy-Pro, Tyr, nor-Tyr, mono-halo- 
Tyr, di-halo-Tyr, O-sulpho-Tyr, O-phospho-Tyr, nitro-Tyr or any unnatural hydroxy containing 
amino acid; Xaag is Gly, He, Leu, Val, Ala, Thr, Ser, Pro, hydroxy-Pro, Phe, Trp (D or L), neo-Trp, 
halo-Trp, Arg, ornithine, homoarginine, Lys, N-methyl-Lys, N,N-dimethyl-Lys, N,N,N-trimethyl- 
Lys, any unnatural basic amino acid or any unnatural aromatic amino acid; Xaa,o is Ala, Asn, Phe, 
Pro, hydroxy-Pro, Glu, Gla, Gin, His, halo-His, Val, Ser, Thr, Arg, ornithine, homoarginine, Lys, 
N-methyl-Lys, N,N-dimethyl-Lys, N,N,N-trimethyl-Lys or any unnatural basic amino acid; Xaaj, 
is Thr, Ser, His, halo-His, Leu, He, Val, Asn, Met, Pro, hydroxy-Pro, Arg, ornithine, homoarginine, 
Lys, N-methyl-Lys, N,N-dimethyl-Lys, N,N,N-trimethyl-Lys, any unnatural basic amino acid, Tyr, 
nor-Tyr, mono-halo-Tyr, di-halo-Tyr, O-sulpho-Tyr, O-phospho-Tyr, nitro-Tyr or any unnatural 
hydroxy containing amino acid; Xaai2 is Asn, Pro, hydroxy-Pro, Gin, Ser, Thr, Arg, ornithine, 
homoarginine, Lys, N-methyl-Lys, N,N-dimethyl-Lys N,N,N-trimethyl-Lys, any unnatural basic 
amino acid, Tyr, nor-Tyr, mono-halo-Tyr, di-halo-Tyr, O-sulpho-Tyr, O-phospho-Tyr, nitrp-Tyr or 
any unnatural hydroxy containing amino acid; Xaa,3 is des-Xaa,3, Gly, Thr, Ser, Pro, hydroxy-Pro, 
Tyr, nor-Tyr, mono-halo-Tyr, di-halo-Tyr, O-sulpho-Tyr, O-phospho-Tyr, nitro-Tyr or any 
unnatural hydroxy containing amino acid; Xaa|4 is des-Xaa,4, He, Val, Asp, Leu, Phe, Arg, ornithine, 
homoarginine, Lys, N-methyl-Lys, N,N-dimethyl-Lys, N,N,N-trimethyl-Lys, any urmatural basic 
amino acid, Tyr, nor-Tyr, mono-halo-Tyr, di-halo-Tyr, O-sulpho-Tyr, O-phospho-Tyr, nitro-Tyr or 
any unnatural hydroxy containing amino acid; and Xaai5 is des-Xaai5, Gly, Ala, Met, Ser, Thr, Trp 
(D or L), neo-Trp, halo-Trp, any unnatural aromatic amino acid, Arg, omithine, homoarginine, Lys, 
N-methyl-Lys, N,N-dimethyl-Lys, N,N,N-trimethyl-Lys or any unnatural basic amino acid; Xaa^^ 
is des-Xaa,6, Trp (D or L), neo-Trp, halo-Trp, any unnatural aromatic amino acid, Arg, ornithine, 
homoarginine, Lys, N-methyl-Lys, N,N-dimethyl-Lys, N,N,N-trimethyl-Lys or any unnatural basic 
amino acid; Xaa^y is des-Xaa,7, Arg, ornithine, homoarginine, Lys, N-methyl-Lys, N,N-dimethyl- 
Lys, N,N,N-trimethyl-Lys or any unnatural basic amino acid. The C-terminus may contain a free 
carboxyl group or an amide group. The halo is preferably bromine, chlorine or iodine, more 
preferably iodine for His or Tyr and bromine for Trp. The Cys residues may be in D or L 
configuration and may optionally be substituted with homocysteine (D or L). The Tyr residues may 
be substituted with the 3-hydroxyl or 2-hydroxyl isomers and corresponding O-sulpho- and O- 



phospho-derivatives. The acidic amino acid residues may be substituted with any synthetic acidic 
bioisoteric amino acid surrogate, e.g., tetrazolyl derivatives of Gly and Ala. 

More specifically, the present invention is directed to a-conotoxin peptides having the 
general formula III: 

V^4r-'Xaa^-Xaa^-Xaa4-Xaa,-Cv5-CVs-Aaa^-Xaa^-Aaag-XaaQ-Cvs=^ 
Xaa,4-Xka,5-Xaai6-Cys-Xaa,7-Xaa,8-Xaa,9-Xaa2o-Xaa2,-Xaa22-Xaa23-Xaa24 (SEQ ID NO:3), wherein 
Xaa, is des-Xaa,, Ser or Thr; Xaaj is des-Xaaj, Asp, Glu, ycarboxy-Glu (Gla), Asp^er or Thr; 
Xaa3 is des-Xaa3, Ala, Gly, Asn, Ser, Thr, Pro, hydroxy-Pro, Arg, omithine, home^arginine, Lys, N- 
methyl-Lys, N,N-dimethyl-Lys, N,N,N-trimethyl-Lys or any unnatural ba^ amino acid; Xaa4 is 
des-Xaa4, Ala, Val, Leu, He, Gly, Glu, Gla, Gin, Asp, Asn, Phe, Pro, h\^oxy-Pro or any unnatural 
aromatic amino acid; Xaa5 is des-Xaa5, Thr, Ser, Asp, Glu, Gla, Gln/Gly, Val, Asp, Asn, Ala, Pro, 
hydroxy-Pro, Arg, ornithine, homoarginine, Lys, N-methyl/Lys, N,N-dimethyl-Lys, N,N,N- 
trimethyl-Lys or any unnatural basic amino acid; Xaa^ is Jm, Ser, Asp, Asn, Met, Val, Ala, Gly, 
Leu, He, Phe, any unnatural aromatic amino acid. Pro, h^roxy-Pro, Tyr, nor-Tyr, mono-halo-Tyr, 
di-halo-Tyr, O-sulpho-Tyr, O-phospho-Tyr, nitro-Tj^ or any unnatural hydroxy containing amino 
acid; Xaa7 is He, Leu, Val, Ser, Thr, Gin, Asn, A^p, Arg, His, halo-His, Phe, any unnatural aromatic 
amino acid, homoarginine, omithine, Lys, N/methyl-Lys, N,N-dimethyl-Lys, N,N,N-trimethyl-Lys, 
any unnatural basic amino acid, Tyr^/fior-Tyr, mono-halo-Tyr, di-halo-Tyr, O-sulpho-Tyr, O- 
phospho-Tyr, nitro-Tyr or any unnaj^^-al hydroxy containing amino acid; Xaajj is Pro, hyroxy-Pro, 
Ser, Thr, lie. Asp, Leu, Val, GJy, Ala, Phe, any unnatural aromatic amino acid, Arg, ornithine, 
homoarginine, Lys, N-methyL4.ys, N,N-dimethyl-Lys, N,N,N-trimethyl-Lys or any unnatural basic 
amino acid; Xaa9 is ValyAla, Gly, He, Leu, Asp, Ser, Thr, Pro, hydroxy-Pro, Arg, omithine, 
homoarginine, Lys, N-nciethyl-Lys, N,N-dimethyl-Lys, N,N,N-trimethyl-Lys or any unnatural basic 
amino acid; Xaajq/s His, halo-His, Arg, homoarginine, ornithine, Lys, N-methyl-Lys, N,N- 
dimethyl-Lys, N-N,N-trimethyl-Lys, any unnatural basic amino acid, Asn, Ala, Ser, Tlir, Phe, He, 
Leu, Gly, Trp/D or L), neo-Trp, halo-Trp, any unnatural aromatic amino acid, Tyr, nor-Tyr, mono- 
halo-Tyr, ch<halo-Tyr, O-sulpho-Tyr, O-phospho-Tyr, nitro-Tyr or any unnatural hydroxy containing 
amino a^d; Xaa,, is Leu, Gin, Val, He, Gly, Met, Ala, Lys, N-methyl-Lys, N,N-dimethyl-Lys, 
N,N,N/trimethyl-Lys, Ser, Thr, Arg, homoarginine, omithine, any unnatural basic amino acid, Asn, 
Glu,yGla, Gin, Phe, Trp (D or L), neo-Trp, halo-Trp or any unnatural aromatic amino acid; Xaai2 is 
Ghd, Gla, Gin, Asn, Asp, Pro, hydroxy-Pro, Ser, Gly, Thr, Lys, N-methyl-Lys, N,N-dimethyl-Lys, 



Hi^r any uiuiQtumHi r uiiiaiic amino acid. L eu, Met, Gly, Ala, T yr, aoi-l yr, mono-halO-Tyr, (ir= 
Tyr, O-sulpho-Tyr, O-phospho-Tyr, nitro-Tyr or any unnatural hydroxy containing amino^id; 
Xaa,3 is His, halo-His, Asn, Thr, Ser, He, Val, Leu, Phe, any unnatural aromatic amino^id, Arg, 
homoarginine, ornithine, Lys, N-methyl-Lys, N,N-dimethyl-Lys, N,N,N-trim^yl-Lys, any 
5 unnatural basic amino acid, Tyr, nor-Try, mono-halo-Tyr, di-halo-Tyr, 0-sulpjK>Tyr, O-phospho- 
Tyr, nitro-Tyr or any unnatural hydroxy containing amino acid; Xaa,4 isj^er, Thr, Ala, Gin, Pro, 
hydroxy-Pro, Gly, He, Leu, Arg, omithine, homoarginine, Lys, N-metJ?yl-Lys, N,N-dimethyl-Lys, 
N,N,N-trimethyl-Lys or any unnatural basic amino acid; Xaajj is AarC Glu, Gla, Asp, Gly, His, halo- 
His, Ala, Leu, Gin, Arg, omithine, homoarginine, Lys, N-metlw-Lys, N,N-dimethyl-Lys, N,N,N- 
1 0 trimethyl-Lys, any unnatural basic amino acid, Tyr, nor-Tyr^ono-halo-Tyr, di-halo-Tyr, O-sulpho- 
Tyr, O-phospho-Tyr, nitro-Tyr or any unnatural hydroxVcontaining amino acid; Xaa^^ is Met, He, 
Thr, Ser, Val, Leu, Pro, hydroxy-Pro, Phe, any unnatural aromatic amino acid, Tyr, nor-Tyr, mono- 
halo-Tyr, di-halo-Tyr, O-sulpho-Tyr, O-phospho-Tyr, nitro-Tyr, any unnatural hydroxy containing 
amino acid, Glu, Gla, Ala, His, halo-His, Arg^rnithine, homoarginine, Lys, N-methyl-Lys, N,N- 
1 5 dimethyl-Lys, N,N,N-trimethyl-Lys or any unnatural basic amino acid; Xaa^ is des-Xaai7, Gly, Asp, 
Asn, Ala, He, Leu, Ser, Thr, His, halo-HisyArg, omithine, homoarginine, Lys, N-methyl-Lys, N,N- 
dimethyl-Lys, N,N,N-trimethyl-Lys or any unnatural basic amino acid; Xaajg is des-Xaa,8, Gly, Glu, 
Gla, Gin, Trp (D or L), neo, halc/Trp, any unnatural aromatic amino acid, Arg, ornithine, 
homoarginine, Lys, N-methyl-Lys/JS[,N-dimethyl-Lys, N,N,N-trimethyl-Lys or any unnatural basic 
20 amino acid; Xaa^^ is des-Xaa]9, S^r, Thr, Val, He, Ala, Arg, omithine, homoarginine, Lys, N-methyl- 
Lys, N,N-dimethyl-Lys, N,NJy-trimethyl-Lys or any unnatural basic amino acid; Xaa2o is des-Xaa2o, 
Val, Asp, His, halo-His, Arg, ornithine, homoarginine, Lys, N-methyl-Lys, N,N-dimethyl-Lys, 
N,N,N-trimethyl-Lys or>any unnatural basic amino acid; Xaa2i is des-Xaa2i, Asn, Pro or hydroxy- 
Pro; Xaa22 is des-Xa^j^ Arg, ornithine, homoarginine, Lys, N-methyl-Lys, N,N-dimethyl-Lys, 
25 N,N,N-trimethyl-L^ or any unnatural basic amino acid; Xaa23 is des-Xaa23, Ser or Thr; Xaa24 is des- 
Xaa24, Leu, He or Val; with the proviso that (a) Xaag is not Gly, when Xaaj is des-Xaaj, Xaa2 is des- 
Xaaj, Xaa3 is/aes-Xaa3, Xaa4 is des-Xaa4, Xaa^ is Ser, Xaay is His, Xaa^ is Pro, Xaa9 is Ala, Xaa,o 
is Ser, Xaa^ is Val, Xaai2 is Asn, Xaa,3 is Asn, Xaai4 is Pro, Xaa^^As Asp, Xaaj^ is He, Xaa,7 is des- 
Xaa,7, X^a,8 is des-Xaa,8, Xaa,9 is des-Xaa,9, Xaa2o is des-Xaa2o, Xaaji is des-Xaaji, Xaa22 is des- 
30 Xaa;^ Xaa23 is des-Xaa23, and Xaa24 is des-Xaa24. The C-terminus may contain a free carboxyl 
)up or an amide group. The halo is preferably bromine, chlorine or iodine, more preferably iodine 

For Hf^T-ftfH -Tyr lin4 -H-*-ni M"^ f^ly^-rr^^^nnr . mny hn in r > nr f nnnfi p^^ p^fff ^tx-aitdi^taV 
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npti i^ nally hg -sTTtrstituted with nomocysieine t U oi L). Tlic Tyi rosiducs be ai ife s t itutecHatt^ thc - 
3-hydroxyl or 2-hydroxyl isomers and con*esponding.£izsuipho='^n3^-^^ The 
acidic amino acid residues-fftay'"be''substituted with any synthetic acidic bioisoteric amino acid 

The present invention is also directed to novel specific a-conotoxin peptides of general 
formula I having the formulas: 

Asp-Xaa,-Cys-Cys-Ser-Asp-Ser-Arg-Cys-Gly-Xaa2-Asn-Cys-Leu (SEQ ID NO:4); 
Ala-Cys-Cys-Ser-Asp-Arg-Arg-Cys-Arg-Xaa3-Arg-Cys (SEQ ID NO:5); 
Phe-Thr-Cys-Cys-Arg-Arg-Giy-Thr-Cys-Ser-Gln-His-Cys (SEQ ID NO:6); 
Asp-Xaa4-Cys-Cys-Arg-Arg-His-Ala-Cys-Thr-Leu-Ile-Cys (SEQ ID NO:7); 
Asp-Xaa4-Cys-Cys-Arg-Xaa5-Xaa5-Cys-Thr-Leu-Ile-Cys (SEQ ID NO: 8); 
Gly-Cys-Cys-Ser-Asp-Xaa5-Arg-Cys-Arg-Xaa4-Arg-Cys-Arg (SEQ ID NO:9); 
Gly-Gly-Cys-Cys-Ser-Asp-Xaa5-Arg-Cys-Ala-Xaa3-Arg-Cys (SEQ ID NO: 10); 
Ile-Ala-Xaa3-Asp-Ile-Cys-Cys-Ser-Xaai-Xaa5-Asp-Cys-Asn-His-Xaa2-Cys-Val (SEQ ID 
NO: 11); and 

ys-Uys-Ser-Asp-Xaaj-Arg-Cys-Xaaj-hlis-Lrln-Uys (ShQ ID NOjJ 




wheretn Xaaj is Glu or y-carboxy-Glu (Gla); Xaa^ is Lys, N-meth>;J;^s, N,N-dimethyl-Lys or 
N,N,N-trimethyl-Lys; Xaa3 is Trp (D or L), halo-Trp or nep-Tr0;^aa4 is Tyr, nor-Tyr, mono-halo- 
Tyr, di-halo-Tyr, O-sulpho-Tyr, O-phospho-Tyr or nitro-Tyf; and Xaa^ is Pro or hydroxy-Pro; and 
the C-terminus contains a carboxyl or amide groups/The halo is preferably bromine, chlorine or 
iodine, more preferably iodine for Tyr and bromiiie for Trp. In addition, the His residues may be 
substituted with halo-His; the Arg residues nmy be substituted by Lys, ornithine, homoargine, N- 
methyl-Lys, N,N-dimethyl-Lys, N,N,N-J*imethyl-Lys or any unnatural basic amino acid; the Lys 
residues may be substituted by Ars/omithine, homoargine, N-methyl-Lys, N,N-dimethyl-Lys, 
N,N,N-trimethyl-Lys or any unnauiral basic amino acid; the Tyr residues may be substituted with 
any unnatural hydroxy containing amino acid; the Ser residues may be substituted with Thr; the Thr 
residues may be substituted with Ser; and the Phe and Trp residues may be substituted with any 
unnatural aromatu/amino acid. The Cys residues may be in D or L configuration and may 
optionally be substituted with homocysteine (D or L). The Tyr residues may be substituted with the 
3-hydrox\^r 2-hydroxyl isomers and corresponding O-sulpho- and O-phospho-derivatives. The 
acidic/amino acid residues may be substituted with any synthetic acidic bioisoteric amino acid 
' Qgatc. O rs^g^ et iazolvl jd^riv^alivtfirof Olv an d Ala 
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More specifically, the present invention is directed to the following a-conotoxin peptides 
of general formula I: 



Iml . 1 : SEQ ID NO:4, wherein Xaa, is Glu and Xaaj is Lys; 

Iml.2: SEQ ID NO:5, wherein Xaaj is Trp; 

5 Rgl.2: SEQIDNO:6; 

Rgl.6: SEQ ID NO:7, wherein Xaa4 is Tyr; 

Rgl .6A: SEQ ID NO:8, wherein Xaa^ is Tyr and Xaa, is Pro; 

Rgl .7: SEQ ID NO:9, wherein Xaa4 is Tyr and Xaa^ is Pro; 

Rgl.9: SEQIDNO:10, whereinXaajisTrpandXaajisPro; 

10 Rgl . 1 0: SEQ ID NO: 1 1 , wherein Xaa, is Glu, Xaaj is Lys, Xaa3 is Trp and Xaaj is 

Pro; and 

Rgl.ll : SEQ ID NO: 12, wherein Xaaj is Lys and Xaa, is Pro. 



The C-terminus of Iml. 1, Rgl. 7 an Rgl. 10 preferably contains a free carboxyl group. The C- 
terminus of Iml.2, Rgl.2, Rgl.6, Rgl.6A, Rgl.9 and Rgl.l 1 preferably contains an amide group. 
1 5 The present invention is further directed to novel specific a-conotoxin peptides of general 

formula II having the formulas: 

Cys-Cys-Ser-Asp-Xaa5-Ala-Cys-Xaa2-Gln-Thr-Xaa5-Gly-Cys-Arg (SEQ ID NO: 13); 

Cys-Cys-Xaa,-Asn-Xaa5-Ala-Cys-Arg-His-Thr-Gln-Gly-Cys (SEQ ID NO: 14); 

Gly-Cys-Cys-Xaa3-His-Xaa5-Ala-Cys-Gly-Arg-His-Xaa4-Cys (SEQ ID NO: 15); 
20 Ala-Xaaj-Cy s-Cy s- Asn- Asn-Xaaj- Ala-Cys- Val-Xaa2-His- Arg-Cys (SEQ ID NO : 1 6) ; 

Ala-Xaa5-Gly-Cys-Cys-Asn-Asn-Xaa5-Ala-Cys-Val-Xaa2-His-Arg-Cys (SEQ ID NO: 17); 

Xaa5-Xaa5-Cys-Cys-Asn-Asn-Xaa5-AIa-Cys-Val-Xaa2-His-Arg-Cys (SEQ ID NO: 18); 

Asp-Xaa, - Asn-Cys-Cys-Xaaj- Asn-Xaaj-Ser-Cys-Xaaj-Arg-Xaaj- Arg-Cys-Thr (SEQ ID 
NO: 19); 

25 Gly-Cys-Cys-Ser-Thr-Xaa5-Xaa5-Cys-Ala-Val-Leu-Xaa4-Cys (SEQ ID NO:20); 

Gly-Cys-Cys-Gly-Asn-Xaa5-Asp-Cys-Thr-Ser-His-Ser-Cys (SEQ ID N0:21); 
Gly-Cys-Cys-Ser-Asn-Xaas-Xaaj-Cys-Ala-His-Asn-Asn-Xaaj-Asp-Cys-Arg (SEQ ID 
NO:42); 

Gly-Cys-Cys-Xaa4-Asn-Xaa5-Val-Cys-Xaa2-Xaa2-Xaa4-Xaa4-Cys-Xaa3-Xaa2 (SEQ ID 
30 NO: 154); 

Xaa^-Xaai-Xaaj-Gly-Cys-Cys-Arg-His-Xaas-Ala-Cys-Gly-Xaaj-Asn-Arg-Cys (SEQ ID 
NO: 155); 



9 

Cys-Cys-Ala-Asp-Xaaj-Asp-Cys-Arg-Phe-Arg-Xaaj-Gly-Cys (SEQ ID NO: 156); 
Gly-Cys-Cys-Xaa4-Asn-Xaa5-Ser-Cys-Xaa3-Xaa5-Xaa2-Thr-Xaa4-Cys-Ser-Xaa3-Xaa2 (SEQ 
ID NO: 157); 

Cys-Cys-Ser-Asn-Xaa5-Thr-Cys-Xaa2-Xaa,-Thr-Xaa4-Gly-Cys (SEQ ID NO: 158); 
5 Cys-Cys-Ala-Asn-Xaa5-Ile-Cys-Xaa2-Asn-Thr-Xaa5-Gly-Cys (SEQ ID NO: 1 59); 

Cys-Cys-Asn-Asn-Xaa5-Thr-Cys-Xaa2-Xaa,-Thr-Xaa4-Gly-Cys (SEQ ID NO: 160); 
Cys-Cys-Ser-Asn-Xaa5-Val-Cys-Xaa2-Xaa,-Thr-Xaa4-Gly-Cys (SEQ ID NO: 161); 
Gly-Gly-Cys-Cys-Ser-Xaa4-Xaa5-Xaa5-Cys-Ile-Ala-Ser-Asn-Xaa5-Xaa2-Cys-Gly (SEQ ID 
NO: 162); 

10 Gly-Cys-Cys-Ser-His-Xaas-Val-Cys-Ser-Ala-Met-Ser-Xaaj-Ile-Cys (SEQ ID NO: 163); 

5 Gly-Cys-Cys-Xaa2-Asn-Xaa5-Xaa4-Cys-Gly-Ala-Ser-Xaa2-Thr-Xaa4-Cys(SEQIDNO:164); 

Ji Gly-Cys-Cys-Ser-Xaa4-Xaa5-Xaa5-Cys-Phe-Ala-Thr-Asn-Xaa5-Asp-Cys (SEQ ID NO: 1 65); 

In Gly-Gly-Cys-Cys-Ser-Xaa4-Xaa5-Xaa5-Cys-Ile-Ala-Asn-Asn-Xaa5-Leu-Cys-Ala (SEQ ID 

g NO: 166); 

r 1 5 Gly-Gly-Cys-Cys-Ser-Xaa4-Xaa5-Xaa5-Cys-Ile-Ala-Asn-Asn-Xaa5-Phe-Cys-Ala (SEQ ID 

NO:167); 

Asp-Cys-Cys-Ser-Asn-Xaa5-Xaa5-Cys-Ser-Gln-Asn-Asn-Xaa5-Asp-Cys-Met (SEQ ID 
NO: 168); and 

JtA ^Cys C^-.S er n A^tt r^ : aa5-Xaa5-C>^-Ala-H is- Asii-Asii-X^ Asp Cy s^^Af^ U^Q^ ID" 
NO: 169), ^^.-"""^^'^^ 
wherein Xaa, is Glu or y-carboxy-Glu (Gla); Xaaj is Lys, N-mefnyl-Lys, N,N-dimethyl-Lys or 
N,N,N-trimethyl-Lys; Xaa3 is Trp (D or L), halo-Trp or nep^^p; Xaa4 is Tyr, nor-Tyr, mono-halo- 
Tyr, di-halo-Tyr, O-sulpho-Tyr, O-phospho-Tyr or^Kffo-Tyr; and Xaa5 is Pro or hydroxy-Pro; and 
the C-terminus contains a carboxyl or amidp^oup. The halo is preferably bromine, chlorine or 
25 iodine, more preferably iodine for Typ^d bromine for Trp. In addition, the His residues may be 
substituted with halo-Hi s; the Arg residues may be substituted by Lys, ornithine, homoargine, N- 
methyl-Lys, NjN-dimetljyl^ys, N,N,N-trimethyl-Lys or any unnatural basic amino acid; the Lys 
residues may be sj^tituted by Arg, ornithine, homoargine, N-methyl-Lys, N,N-dimethyl-Lys, 
N,N,N-trimetj;i5n-Lys or any unnatural basic amino acid; the Tyr residues may be substituted with 
30 any unnatdral hydroxy containing amino acid; the Ser residues may be substituted with Thr; the Thr 
residues may be substituted with Ser; and the Phe and Trp residues may be substituted with any 
^nnf\\ur^ nrnmntif^ -fnTri- n ft ac i d — Th e tj^ys--^ tisidagTrTTTriy - bo in D or L confignr-ott ^ g ^nr faBsay - 
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etrrionattyb e - subatituted with h o m n cystfinp (D or L). Th e Tyr rccidueo may be sub ctituted -Math th e— 
3-hydroxyl or 2-hydroxyl isomers and con^SEOncUftg-O-sulplio- and U-phospho-derivatives. The 
acidic amino acid residiies-tnSy'be substituted with any synthetic acidic bioisoteric amino acid 
^ ir ogatc, l i^g^Ttt^Uagolyl ' dciivativoD of Gly an? ^^* 
5 More specifically, the present invention is directed to the following a-conotoxin peptides 

of general formula II: 



Snl.l: SEQ ID NO: 13, wherein Xaaj is Lys and Xaa^ is Pro; 

Snl .2: SEQ ID NO:14, wherein Xaa, is Glu and Xaa^ is Pro; 

SI 1 .3 : SEQ ID NO: 1 5, wherein Xaaj is Trp, Xaa4 is Tyr and Xaa^ is Pro; 

10 A1.2: SEQ ID NO : 16, wherein Xaaj is Lys and Xaaj is Pro; 

Bul . 1 : SEQ ID NO: 1 7, wherein Xaa2 is Lys and Xaaj is Pro; 

Bui .2: SEQ ID NO:l 8, wherein Xaa2 is Lys and Xaa^ is Pro; 

Bul. 3: SEQ ID NO: 19, wherein Xaa, is Glu, Xaaj is Trp and Xaaj is Pro; 

Bul .4: SEQ ID NO:20, wherein Xaa4 is Tyr and Xaa^ is Pro ; 

15 Crl.3: SEQ ID NO:21, wherein Xaaj is Pro; 

Di 1 . 1 : SEQ ID NO:42 wherein Xaaj is Pro; 

Ms 1.7: SEQ ID NO: 154, wherein Xaaj is Lys, Xaa^ is Trp, Xaa4 is Tyr and Xaa; is 
Pro; 

PI. 7: SEQ ID NO: 155, wherein Xaa, is Glu, Xaaj is Lys, Xaa^ is Pro and Xaa^ is 

20 Gin; 

Msl .2: SEQ ID NO:l 56, wherein Xaaj is Pro; 

Ms 1 .3 : SEQ ID NO: 1 57, wherein Xaa2 is Lys, Xaaj is Trp, Xaa4 is Tyr and Xaa^ is 
Pro; 

Ms 1.4: SEQ ID NO: 158, wherein Xaa, is Glu, Xaa2 is Lys, Xaa4 is Tyr and Xaa; is 

25 Pro; 

Msl .5: SEQ ID NO:l 59, wherein Xaa2 is Lys and Xaa, is Pro; 

Msl .8: SEQ ID NO:160, wherein Xaa, is Glu, Xaaj is Lys, Xaa4 is Tyr and Xaa, is 
Pro; 

Msl .9: SEQ ID NO:161, wherein Xaa, is Glu, Xaaj is Lys, Xaa4 is Tyr and Xaa, is 

30 Pro; 

Btl .7: SEQ ID NO: 162, wherein Xaaz is Lys, Xaa4 is Tyr and Xaa, is Pro; 

Lvl.5: SEQ ID NO: 163, wherein Xaa, is Pro; 




• 



Msl.lO 



SEQ ID NO: 164, wherein Xaa2 is Lys, Xaa^ is Tyr and Xaa^ is Pro; 



Vrl.l: 



Oml.l: 



R1.6: 



R1.7: 



SEQ ID NO: 165, wherein Xaa4 is Tyr and Xaa5 is Pro; 
SEQ ID NO: 166, wherein Xaa4 is Tyr and Xaaj is Pro; 
SEQ ID NO: 167, wherein Xaa4 is Tyr and Xaa^ is Pro; 
SEQ ID NO: 168, wherein Xaa^ is Pro; and 



Vrl.2: 



SEQ ID NO: 169, wherein Xaa; is Pro. 



The C-terminus preferably contains a carboxyl group for the peptides Sn 1 . 1 , Sn 1 .2, Cr 1 .3, Di 1 . 1 , 
Msl.2, Msl.4, Msl.5, Msl.8, Msl.9, Vrl.l and Vrl.2. The C-terminus of the other peptides 
preferably contains an amide group. 

The present invention is also directed to novel specific a-conotoxin peptides of general 
formula III having the formulas: 

Gly-Cys-Cys-Ser-Asn-Xaa5-Val-Cys-His-Leu-Xaa,-His-Ser-Asn-Met-Cys(SEQIDNO:22); 

Gly-Cys-Cys-Ser-Asn-Xaa5-Val-Cys-Arg-Gln-Asn-Asn-Ala-Xaa,-Xaa4-Cys-Arg (SEQ ID 
NO:23); 

Xaa5-Gln-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Asn-Val-Asp-His-Xaa5-Xaa|-Ile-Cys-Arg (SEQ 
IDNO:24); 

Xaa5-Xaa,-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Asn-Val-Asp-His-Xaa5-Xaa|-Ile-Cys-Arg (SEQ 



Xaa5-Gln-Cys-Cys-Ser-His-Xaa5-AIa-Cys-Asn-Val-Asp-His-Xaa5-Xaa|-Ile-Cys-Asp (SEQ 
ID NO:26); 

Xaaj-Arg-Cys-Cys-Ser-His-Xaaj-Ala-Cys-Asn-Val-Asp-His-Xaaj-Xaai-Ile-Cys-Arg (SEQ 
IDNO:27); 

Xaa5-Gln-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Asn-Val-Asp-His-Xaa5-GIy-Ile-Cys-Arg(SEQ 
IDNO:28); 

Xaaj-Gln-Cys-Cys-Ser-His-Xaas-Ala-Cys-Asn-Val-Asp-His-Xaas-Xaai-Thr-Cys-Arg (SEQ 
ID NO:29); 

Xaa5-Gln-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Asn-Val-Asp-His-Xaa5-Xaa,-VaI-Cys-Arg(SEQ 
IDNO:30); 

Xaa5-Gln-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Asn-Ile-Asp-His-Xaa5-Xaa,-lle-Cys-Arg (SEQ 
IDNO:31); 

Xaa5-Gln-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Asn-Val-Asp-His-Xaa5-Xaa,-lle-Cys-Arg-Arg- 
Arg-Arg (SEQ ID NO:32); 



ID NO 



25); 
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Gly-Gly-Cys-Cys-Ser-His-Xaas-Ala-Cys-Ala-Val-Asn-His-Xaaj-Xaai-Leu-Cys (SEQ ID 
NO:33); 

Gly-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Ser-Val-Asn-His-Xaa5-Xaa,-Leu-Cys(SEQIDNO:34); 
Gly-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Asn-Val-Asp-His-Xaa5-Xaa,-Ile-Cys(SEQIDNO:35); 
Gly-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Ser-GIy-Xaa2-Thr-Gin-Xaa,-Xaa5-Cys-Arg-Xaa,-Ser 
(SEQ ID NO:36); 

Xaa5-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Ser-Gly-Asn-Asn-Xaa5-Xaa,-Phe-Cys-Arg-GIn (SEQ 
IDNO:37); 

Gly-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Ser-Gly-Asn-Asn-Xaa5-Xaa,-Phe-Cys-Arg-Gln (SEQ 
ID NO:38); 

Gly-Cys-Cys-Ser-His-Xaa5-Xaa5-Cys-Ala-Met-Asn-Asn-Xaa5-Asp-Xaa4-Cys (SEQ ID 
NO:39); 

Giy-Cys-Cys-Ser-His-Xaa5-Xaa5-Cys-Phe-Leu-Asn-Asn-Xaa5-Asp-Xaa4-Cys (SEQ ID 
NO:40); 

Gly-Cys-Cys-Ser-Asn-Xaaj-Xaas-Cys-Ile-Ala-Xaaz-Asn-Xaas-His-Met-Cys-Gly (SEQ ID 
NO:41); 

Gly-Cys-Cys-Ser-Asn-Xaaj-Ala-Cys-Ala-Gly-Asn-Asn-Xaas-His-Val-Cys-Arg-GIn (SEQ 
ID NO:43); 

Gly-Cys-Cys-Ser-Arg-Xaa5-Ala-Cys-Ile-Aia-Asn-Asn-Xaa5-Asp-Leu-Cys(SEQIDNO:44); 
Gly-Cys-Cys-Ser-Asn-Xaa5-Val-Cys-His-Val-Xaa|-His-Xaa5-Xaa|-Leu-Cys-Arg-Arg-Arg- 
Arg (SEQ ID NO:45); 

Gly-Gly-Cys-Cys-Ser-Phe-Xaa5-Ala-Cys-Arg-Xaa2-Xaa5-Arg-Xaa5-Xaa,-Met-Cys-Gly(SEQ 
ID NO:46); 

Xaa5-Xaa,-Cys-Cys-Ser-Asp-Xaa5-Arg-Cys-Asn-Ser-Ser-His-Xaaj-Xaai-Leu-Cys-Gly(SEQ 
ID NO:47); 

Xaa5-GIn-Cys-Cys-Ser-Asp-Xaa5-Arg-Cys-Asn-Val-Giy-His-Xaa5-Xaa,-Leu-Cys-Gly(SEQ 
ID NO:48); 

Xaa6-Val-Cys-Cys-Ser-Asp-Xaa5-Arg-Cys-Asn-Val-Gly-His-Xaa5-Xaa,-Ile-Cys-Gly (SEQ 
ID NO:49); 

Gly-Cys-Cys-Ser-Arg-Xaas-Xaaj-Cys-Ile-Ala-Asn-Asn-Xaas-Asp-Leu-Cys (SEQ ID 
NO:50); 
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Xaa5-Gln-Cys-Cys-Ser-His-Leu-Ala-Cys-Asn-Val-Asp-His-Xaa5-Xaa,-Ile-Cys-Arg (SEQ 
ID NO:51); 

Gly-Cys-Cys-Ser-Xaa4-Phe-Asp-Cys-Arg-Met-Met-Phe-Xaa5-Xaa,-Met-Cys-Gly-Xaa3-Arg 
(SEQ ID NO:52); 

5 Gly-Gly-Cys-Cys-Ser-Phe-Ala-Ala-Cys-Arg-Xaa2-Xaa4-Arg-Xaa5-Xaa,-Met-Cys-Gly(SEQ 
ID NO:53); 

Gly-Gly-Cys-Cys-Phe-His-Xaa5-Val-Cys-Xaa4-Ile-Asn-Leu-Leu-Xaa,-Met-Cys-Arg-GIn- 
Arg (SEQ ID NO:54); 

Ser- Ala-Thr-Cys-Cys- Asn-Xaa4-Xaa5-Xaa5-Cy s-Xaa4-Xaa, -Thr-Xaa4-Xaa5-Xaa , -Ser-Cy s- 
10 Leu (SEQ ID NO:55); 

Ala-Cys-Cys-Ala-Xaa4-Xaas-Xaa5-Cys-Phe-Xaa,-Ala-Xaa4-Xaa5-Xaa,-Arg-Cys-Leu (SEQ 
ID NO:56); 

Asn-Ala-Xaa,-Cys-Cys-Xaa4-Xaa4-Xaas-Xaa5-Cys-Xaa4-Xaa|-Ala-Xaa4-Xaa5-Xaa,-Ile-Cys- 
Leu (SEQ ID NO:57); 

15 Xaa,-Cys-Cys-Thr-Asn-Xaa5-Val-Cys-His-AIa-Xaa,-His-Gln-Xaa|-Leu-Cys-Ala-Arg-Arg- 
Arg (SEQ ID NO: 170); 

Gly-Cys-Cys-Ser-Asn-Xaa5-Val-Cys-His-Leu-Xaa,-His-Ser-Asn-Leu-Cys (SEQ ID 
NO:171); 

Xaa,-Cys-Cys-Thr-Asn-Xaa5-Val-Cys-His-Val-Xaa,-His-Gln-Xaa|-Leu-Cys-Ala-Arg-Arg- 
20 Arg (SEQ ID NO: 172); 

Xaa6-Xaa,-Cys-Cys-Ser-Xaa4-Xaa5-Ala-Cys-Asn-Leu-Asp-His-Xaa5-Xaa|-Leu-Cys (SEQ 
ID NO: 173); 

Xaa5-Xaa,-Cys-Cys-Ser-Asp-Xaa5-Arg-Cys-Asn-Ser-Thr-His-Xaa5-Xaa|-Leu-Cys-Gly(SEQ 
ID NO: 174); 

25 Leu-Asn-Cys-Cys-Met-Ile-Xaa5-Xaa5-Cys-Xaa3-Xaa2-Xaa2-Xaa4-Gly-Asp-Arg-Cys-Ser- 
Xaa.-Val-Arg (SEQ ID NO: 175); 

Ala-Phe-Gly-Cys-Cys-Asp-Leu-Ile-Xaa5-Cys-Leu-Xaa|-Arg-Xaa4-Gly-Asn-Arg-Cys-Asn- 
Xaa,-Val-His (SEQ ID NO: 176); 

Leu-Gly-Cys-Cys-Asn-Val-Thr-Xaa5-Cys-Xaa3-Xaa,-Xaa2-Xaa4-Gly-Asp-Xaa2-Cys-Asn- 

30 Xaa,-Val-Arg (SEQ ID NO: 1 77); 

Asp-Xaaj-Cys-Cys-Ser-AsnyXaaj-Ala-Cys-Arg-Val-Asn-Asn-Xaas-His-Val-Cys-Arg-Arg- 

Arg (SEQ ID NO: 178); 
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Leu-Asn-Cys-Cys-Ser-Ile-Xaa5-Gly-Cys-Xaa3-Asn-Xaa,-Xaa4-Xaa2-Asp-Arg-Cys-Ser-Xaa2- 
Val-Arg (SEQ ID NO: 179); 

Gly-Gly-Cys-Cys-Ser-His-Xaa5-Val-Cys-Xaa4-Phe-Asn-Asn-Xaa5-Gln-Met-Cys-Arg (SEQ 
IDNO:180); 

5 Gly-Gly-Cys-Cys-Ser-His-Xaa5-Val-Cys-Asn-Leu-Asn-Asn-Xaa5-Gln-Met-Cys-Arg (SEQ 

IDNO:181); 

Gly-Cys-Cys-Ser-His-Xaa5-Xaa5-Cys-Xaa4-Ala-Asn-Asn-Gln-Ala-Xaa4-Cys-Asn (SEQ ID 
NO: 182); 

Gly-Gly-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Ser-Val-Thr-His-Xaa5-Xaa,-Leu-Cys (SEQ ID 
10 NO:183); 

Gly-Gly-Cys-Cys-Ser-Xaa4-Xaa5-Ala-Cys-Ser-Val-Xaa,-His-Gln-Asp-Leu-Cys-Asp (SEQ 
ID NO: 184); 

Val-Ser-Cys-Cys-Val-Val-Arg-Xaa5-Cys-Xaa3-Ile-Arg-Xaa4-Gln-Xaa,-Xaa|-Cys-Leu-Xaa,- 
Ala-Asp-Xaa5-Arg-Thr-Leu (SEQ ID NO: 185); 
15 Xaa6-Asn-Cys-Cys-Ser-Ile-Xaa5-Gly-Cys-Xaa3-Xaa,-Xaa2-Xaa4-Gly-Asp-Xaa2-Cys-Ser- 
Xaa,-Val-Arg (SEQ ID NO: 186); 

Gly-Cys-Cys-Ser-Asn-Xaaj-Val-Cys-His-Leu-Xaai-His-Xaas-Asn-Ala-Cys (SEQ ID 
NO: 187); 

Gly-Cys-Cys-Ser-Asn-Xaa5-Ile-Cys-Xaa4-Phe-Asn-Asn-Xaa5-Arg-Ile-Cys-Arg (SEQ ID 
20 NO: 188); 

Xaa,-Cys-Cys-Ser-Gln-Xaa5-Xaa5-Cys-Arg-Xaa3-Xaa2-His-Xaa5-Xaa,-Leu-Cys-Ser (SEQ 
ID NO: 189); 

Gly-Cys-Cys-Ser-His-Xaas-Ala-Cys-Ala-Gly-Asn-Asn-Gln-His-Ile-Cys (SEQ ID NO: 190); 
Gly-Cys-Cys-AIa-Val-Xaas-Ser-Cys-Arg-Leu-Arg-Asn-Xaas-Asp-Leu-Cys-Gly-Gly (SEQ 
25 IDNO:191); 

Gly-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Asn-Val-Asn-Asn-Xaa5-His-Ile-Cys(SEQIDNO:192); 
Thr-Xaa5-Xaa,-Xaa,-Cys-Cys-Xaa5-Asn-Xaa5-Xaa5-Cys-Phe-Ala-Thr-Asn-Ser-Asp-Ile-Cys- 
Gly (SEQ ID NO: 193); 

Asp-Ala-Cys-Cys-Ser-Asp-Xaa5-Arg-Cys-Ser-Gly-Xaa2-His-Gln-Asp-Leu-Cys (SEQ ID 
30 NO: 194); 

Xaa,-Asp-Cys-Cys-Ser-Asp-Xaa5-Arg-Cys-Ser-Val-Gly-His-Gln-Asp-Leu-Cys (SEQ ID 
NO:195); 
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Gly-Cys-Cys-Ser-His-Xaaj-Ala-Cys-Ala-Gly-Ser-Asn-Ala-His-Ile-Cys (SEQ ID NO: 196); 
Xaai-Asp-Cys-Cys-Ser-Asp-Xaaj-Arg-Cys-Ser-Val-Gly-His-Gln-Asp-Met-Cys (SEQ ID 
NO: 197); 

Gly-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Ala-Gly-Asn-Asn-Xaa5-His-Ile-Cys(SEQIDNO:198); 
5 Gly-Cys-Cys-Gly-Asn-Xaa5-Ser-Cys-Ser-Ile-His-Ile-Xaa5-Xaa4-Val-Cys-Asn (SEQ ID 

NO: 199); 

Thr-Asp-Ser-Xaa,-Xaa|-Cys-Cys-Leu-Asp-Ser-Arg-Cys-Ala-Gly-Gln-His-Gln-Asp-Leu- 
Cys-Gly (SEQ ID NO:200); 

Gly-Cys-Cys-Ser-Asn-Xaa5-Xaa5-Cys-Xaa4-Ala-Asn-Asn-Gln-Ala-Xaa4-Cys-Asn (SEQ ID 
10 NO:201); 

Giy-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Ser-Val-Asn-Asn-Xaa5-Asp-Ile-Cys(SEQIDNO:202); 
Gly-Xaa2-Cys-Cys-Ile-Asn-Asp-Ala-Cys-Arg-Ser-Xaa2-His-Xaa5-Gln-Xaa4-Cys-Ser (SEQ 
IDNO:203); 

Gly-Cys-Cys-Xaa4-Asn-Ile-Ala-Cys-Arg-Ile-Asn-Asn-Xaa5-Arg-Xaa4-Cys-Arg (SEQ ID 
15 NO:204); 

Gly-Cys-Cys-Ser-His-Xaa5-Val-Cys-Arg-Phe-Asn-Xaa4-Xaa5-Xaa2-Xaa4-Cys-Gly (SEQ ID 
NO:205); 

Asp-Xaa,-Cys-Cys-Ala-Ser-Xaa5-Xaa5-Cys-Arg-Leu-Asn-Asn-Xaa5-Xaa4-Val-Cys-His 
(SEQ ID NO:206); 

20 Gly-Cys-Cys-Ser-Asn-Xaa5-Val-Cys-Xaa3-Gln-Asn-Asn-Ala-Xaa|-Xaa4-Cys-Arg-Xaai-Ser 
(SEQ ID NO:207); 

Gly-Cys-Cys-Ser-His-Xaa5-Xaa5-Cys-Ala-Gln-Asn-Asn-Gln-Asp-Xaa4-Cys (SEQ ID 
NO:208); 

Gly-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Ser-Gly-Asn-Asn-Arg-Xaa,-Xaa4-Cys-Arg-Xaa|-Ser 
25 (SEQ ID NO:209); 

Asp-Xaa5-Cys-Cys-Ser-Xzia4-Xaas-Asp-Cys-Gly-Ala-Asn-His-Xaa5-Xaa|-Ile-Cys-Gly(SEQ 
ID NO:210); 

Xaai-Cys-Cys-Ser-Gln-Xaaj-Xaas-Cys-Arg-Xaaa-Xaaj-His-Xaas-Xaai-Leu-Cys-Ser (SEQ 
ID N0:211); 

30 Gly-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Ala-Gly-Asn-Asn-Xaa5-His-Ile-Cys(SEQIDNO:212); 

Gly-Cys-Cys-Ser-Asp-Xaa5-Ser-Cys-Asn-Val-Asn-Asn-Xaa5-Asp-Xaa4-Cys (SEQ ID 
NO:213); 




• 
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Xaai-Xaai-Cys-Cys-Ser-Asp-Xaas-Arg-Cys-Ser-Val-Gly-His-Gln-Asp-Met-Cys-Arg (SEQ 
IDNO:214); 

Gly-Gly-Cys-Cys-Ser-Asn-Xaa5-Ala-Cys-Leu-Val-Asn-His-Leu-Xaa,-Met-Cys (SEQ ID 
NO:215); 

5 Arg-Asp-Xaas-Cys-Cys-Phe-Asn-Xaas-Ala-Cys-Asn-Val-Asn-Asn-Xaas-Gln-Ile-Cys (SEQ 

IDNO:216); 

Cys-Cys-Ser-Asp-Xaaj-Ser-Cys-Xaaj-Arg-Leu-His-Ser-Leu-Ala-Cys-Thr-Gly-Ile-Val-Asn- 
Arg(SEQ ID NO:217); 

Cys-Cys-Thr-Asn-Xaa5-Ala-Cys-Leu-Val-Asn-Asn-Ile-Arg-Phe-Cys-Gly(SEQIDNO:2 1 8); 
1 0 Asp-Xaa , -Cys-Cys-Ser-Asp-Xaaj- Arg-Cy s-His-Gly- Asn- Asn- Arg- Asp-His-Cy s- Al a (SEQ 



Jfi IDNO:219); 

1; Asp-Cys-Cys-Ser-His-Xaas-Leu-Cys-Arg-Leu-Phe-Val-Xaas-Gly-Leu-Cys-Ile (SEQ ID 

W NO:220); 

S Gly-Cys-Cys-Ser-His-Xaa5-Val-Cys-Xaa2-Val-Arg-Xaa4-Xaa5-Asp-Leu-Cys-Arg (SEQ ID 

in 

: 15 NO:221); 



Gly-Gly-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Xaa2-Val-His-Phe-Xaa5-His-Ser-Cys (SEQ ID 
20 NO:224); 

Val-Cys-Cys-Ser-Asn-Xaas-Val-Cys-His-Val-Asp-His-Xaaj-Xaai-Leu-Cys-Arg-Arg-Arg- 
Arg (SEQ ID NO:225); 

Gly-Cys-Cys-Ser-His-Xaas-Val-Cys-Asn-Leu-Ser-Asn-Xaaj-Gln-Ile-Cys-Arg (SEQ ID 
NO:226); 

25 Xaa6-Xaa,-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Asn-Val-Asp-His-Xaa5-Xaa,-Ile-Cys-Arg (SEQ 

ID NO:227); 

Gly-Cys-Cys-Ser-Asn-Xaas-Ala-Cys-Leu-Val-Asn-His-Ile-Arg-Phe-Cys-Gly (SEQ ID 
NO:228); 

Asp-Cys-Cys-Asp-Asp-Xaas-Ala-Cys-Thr-Val-Asn-Asn-Xaaj-Gly-Leu-Cys-Thr (SEQ ID 
30 NO:229); and 



Gly-Cys-Cys-Ser-His-Xaa5-Ala-Cys-Asn-Val-Asn-Asn-Xaa5-His-Ile-Cys(SEQlDNO:222); 
Gly-Cys-Cys-Ser-His-Xaa5-Val-Cys-Xaa2-Val-Arg-Xaa4-Ser-Asp-Met-Cys (SEQ ID 



NO 



223); 




TnAsn - X aa ^OCaa5*Cj'y-^Ile-Ald-Xaa2-A<!li-Xaa5-His-Mel-Cys^Tri7-Qly ^^ 



10 



mi 



15 
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vg4 it;ieiii Xaa, is Olu u i y carboxy Glu -ftS'lH); Xaa^ is i.ys, iN-methyl^ Lys, NJ4-diiue Thyl-bj 
N,N,N-trimet]iyl-Lys; Xaa3 is Trp (D or L), halo-Trp or neo-Trp; Xaa4 is Tyr, nor-Tyj^^lnono-halo- 
Tyr, di-halo-Tyr, O-sulpho-Tyr, O-phospho-Tyr or nitro-Tyr; and Xaa5 is Propr1?ydroxy-Pro; Xaae 
is Gin or pyro-Glu; and the C-terminus contains a carboxyl or amide grpap. The halo is preferably 
bromine, chlorine or iodine, more preferably iodine for Tyr andjafomine for Trp. In addition, the 
His residues may be substituted with halo-His; the Arg^sidues may be substituted by Lys, 
ornithine, homoargine, N-methyl-Lys, N,N-dimethyl^.ys, N,N,N-trimethyl-Lys or any unnatural 
basic amino acid; the Lys residues may be su^^tifuted by Arg, ornithine, homoargine, N-methyl-Lys, 
N,N-dimethyl-Lys, N,N,N-trimethyl-L>^r any urmatural basic amino acid; the Tyr residues may 
be substituted with any unnatup^f hydroxy containing amino acid; the Ser residues may be 
substituted with Thr; the Thr residues may be substituted with Ser; and the Phe and Trp residues 
may be substituted witfi any unnatural aromatic amino acid. The Cys residues may be in D or L 
configuration and^iay optionally be substituted with homocysteine (D or L). The Tyr residues may 
be substiUit€d with the 3-hydroxyl or 2-hydroxyl isomers and corresponding O-sulpho- and O- 
phosjrfio-derivatives. The acidic amino acid residues may be substituted with any synthetic acidic 
.^bter suieric ain i-Re-a&id- sufy oG a t e , e .g., t e tr < i2olv l" d eri Tatrvg s:SgS4^fa^d=^s^^ 

More specifically, the present invention is directed to the following a-conotoxin peptides 
of general formula III: 



CI 


SmI: 


SEQ ID NO:22, wherein Xaa, 


is Glu and Xaa^ is Pro; 


20 


OB-29: 


SEQ ID NO:23, wherein Xaa, 


is Glu, Xaaj is Tyr and Xaas is Pro; 




Txl.l: 


SEQ ID NO:24, wherein Xaa, 


is Glu and Xaaj is Pro; 




Rl.lA: 


SEQ ID NO:25, wherein Xaa, 


is Glu and Xaa^ is Pro; 




Rl.lB: 


SEQ ID NO:26, wherein Xaa, 


is Glu and Xaaj is Pro; 




Om-9: 


SEQ ID NO:27, wherein Xaa, 


is Glu and Xaa, is Pro; 


25 


Om-10: 


SEQ ID NO:28, wherein Xaa, 


is Pro; 




Om-21: 


SEQ ID NO:29, wherein Xaa, 


is Glu and Xaaj is Pro; 




Om-25: 


SEQ ID NO: 30, wherein Xaa, 


is Glu and Xaa, is Pro; 




Om-27: 


SEQ ID NO:31 , wherein Xaa, 


is Glu and Xaa, is Pro; 




Om-28: 


SEQ ID NO:32, wherein Xaa, 


is Glu and Xaa, is Pro; 


30 


Btl.2: 


SEQ ID NO:33, wherein Xaa, 


is Glu and Xaa, is Pro; 




Btl.4: 


SEQ ID NO:34, wherein Xaa, 


is Glu and Xaa^ is Pro; 




Dal.l: 


SEQ ID NO:35, wherein Xaa, 


is Glu and Xaa, is Pro; 
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OB-20: 


SEQ 




TI: 






TIB: 


SEQ 




rnl .1 : 


SEQ 


5 


rnl, 2: 


SEQ 




T' 1 . 

1 1 : 


SEQ 




1 lA: 


SEQ 




Dal .2: 


SEQ 




Crl .2: 


SEQ 


1 A 


Cl 1 ^ . 

oil .2: 


SEQ 




Txl.3: 


SEQ 




DaL3: 


SEQ 




Dal. 4: 


SEQ 




Txl.2: 


SEQ 


15 


Om-35: 


SEQ 




Sll.l: 


SEQ 
Pro; 




C 1 1 iC. 

ISll.o: 


SEQ 
Fro; 


20 


oil T . 

S11.7: 


SEQ 




Btl.l : 


SEQ 




Bt: 1 .3: 


SEQ 




Btl.5: 


SEQ 




Al .4: 


SEQ 


ZD 


A 1 C • 

Ai.j. 


SEQ 




A1.6: 


SEQ 




Afl.l: 


SEQ 
Gin; 




Afl.2: 


SEQ 


30 


Arl.2: 


SEQ 



and Xaa5 is Pro; 

ArL3: SEQ ID NO: 176, wherein Xaa^ is Glu, Xaa4 is Tyr and Xaa5 is Pro; 



10 



15 



20 



25 



30 
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Arl .4: SEQ ID NO: 177, wherein Xaa, is Glu, Xaa, is Lys, Xaaj is Trp, Xaa4 is Try 

and Xaas is Pro; 

Arl .5: SEQ ID NO: 178, wherein Xaa, is Glu and Xaa^ is Pro; 

Arl .6: SEQ ID NO: 179, wherein Xaa, is Glu, Xaa2 is Lys, Xaaj is Trp, Xaa4 is Try 

and Xaaj is Pro; 

Ay 1 .2: SEQ ID NO: 1 80, wherein Xaa4 is Tyr and Xaaj is Pro; 

Ayl.3: SEQ IDNO:181, wherein Xaaj is Pro; 

Bnl .4: SEQ ID NO: 182, wherein Xaa4 is Tyr and Xaaj is Pro; 

Btl.8: SEQ IDNO:183, wherein Xaa, is Glu and Xaaj is Pro; 

Btl .9: SEQ ID NO: 1 84, wherein Xaa, is Glu, Xaa4 is Tyr and Xaaj is Pro; 

Cal .3: SEQ ID NO: 185, wherein Xaai is Glu, Xaa^ is Trp, Xaa4 is Try and Xaaj is 

Pro; 

Cal .4: SEQ ID NO: 1 86, wherein Xaa, is Glu, Xaa, is Lys, Xaa3 is Trp, Xaa4 is Try, 

Xaaj is Pro and Xaag is Gin; 
CI .2: SEQ ID NO: 187, wherein Xaa, is Glu and Xaaj is Pro; 

Cl .3: SEQ ID NO:188, wherein Xaa4 is Tyr and Xaa^ is Pro; 

Epl.2: SEQ ID NO:189, wherein Xaa, is Glu, Xaa, is Lys, Xaa3 is Trp and Xaaj is 

Pro; 

G 1 . 1 : SEQ ID NO: 1 90, wherein Xaa^ is Pro; 

G1.3: SEQ ID NO: 191, wherein Xaas is Pro; 

Iml.3: SEQ ID NO: 192, wherein Xaa^ is Pro; 

Lvl.2: SEQ ID NO: 193, wherein Xaa, is Glu and Xaaj is Pro; 

Lvl .3: SEQ ID NO: 194, wherein Xaaj is Lys and Xaa^ is Pro; 

Lvl.4: SEQ ID NO: 195, wherein Xaa, is Glu and Xaa^ is Pro; 

Lvl .6: SEQ ID NO: 196, wherein Xaaj is Pro; 

Lvl. 7: SEQ ID NO: 197, wherein Xaa, is Glu and Xaa^ is Pro; 

Lvl. 8: SEQ ID NO: 198, wherein Xaa^ is Pro; 

Lvl .9: SEQ ID NO: 199, wherein Xaa4 is Tyr and Xaa^ is Pro; 

Lvl. 10: SEQ ID NO:200, wherein Xaa, is Glu; 

Mrl.3: SEQ ID NO:201, wherein Xaa4 is Tyr and Xaaj is Pro; 

Mrl.4: SEQ ID NO:202, wherein Xaa, is Pro; 

Msl . 1 : SEQ ID NO:203, wherein Xaa, is Lys, Xaa4 is Tyr and Xaaj is Pro; 



20 





MsLo: 


SbQ ID NU:204, wnerem Xaa4 is lyr and Xaaj is Pro; 




Ol.l : 


bbQ ID NO:2(J5, wherein Xaa2 is Lys, Xaa4 is lyr and Xaa5 is Pro; 




Ul .2: 


bbQ ID NU:2Uo, wherein Xaa, is CjIu, Xaa4 is lyr and Xaag is Pro; 




Ol .4: 


bhQ ID NO:207, wherein Xaaj is Glu, Xaa3 is Irp, Xaa4 is Tyr and Xaas is 


c 

D 




Pro; 




01.7: 


bEQ ID NO:208, wherein Xaa4 is Tyr and Xaaj is Pro; 




1 o . 

01.8: 


bbQ ID NO:209, wherein Xaa, is Glu, Xaa4 is Tyr and Xaa5 is Pro; 




Oml .2: 


bhQ ID NO:210, wherein Xaa, is Olu, Xaa4 is lyr and Xaa5 is Pro; 




Oml.j: 


SEQ ID NO:21l5 wherein Xaa, is Glu, Xaa2 is Lys, Xaa3 is Trp and Xaag is 


10 




Pro; 


m 


Oml. 4: 


SEQ ID NO:212, wherein Xaa5 is Pro; 


i.h 
'crs- 


Oml. 5: 


SEQ ID NO:213, wherein Xaa4 is Tyr and Xaag is Pro; 


! 


Oml. 6: 


SEQ ID NO:214, wherein Xaaj is Glu and Xaa5 is Pro; 




PI .4: 


SEQ ID NO:215, wherein Xaai is Glu and Xaa5 is Pro; 


" " 1 c 


PI .5: 


SEQ ID NO:21o5 wherein Xaag is Pro; 


il 


PI .6: 


SEQ ID NU:217, wherein Xaa3 is Irp and Xaa5 is Pro; 


if : ^ 


PI .8: 


SEQ ID NO:218, wherein Xaaj is Pro; 




Rgl.l: 


SEQ ID NO:219, wherein Xaa, is Glu and Xaas is Pro; 




Rgl.j: 


SEQ ID NO:220, wherein Xaag is Pro; 


20 


Rgl.4: 


C^T"*/^ TTA XT/^ '^'^ 1 1 • XT" • T XT' * J XT' * T* 

SEQ ID NO:221, wherein Xaa2 is Lys, Xaa4 is Tyr and Xaa5 is Pro; 




Rgl.5: 


SEQ ID NO:222, wherein Xaa5 is Pro; 




Rgl.8: 


SEQ ID NO:223, wherein Xaa2 is Lys, Xaa4 is Tyr and Xaa5 is Pro; 




SmL4: 


SEQ ID NO:224, wherein Xaa2 is Lys and Xaa5 is Pro; 




bml.5: 


SEQ ID NO:225, wherein Xaa, is Glu and Xaaj is Pro; 


25 


S1.5: 


SEQ ID NO:226, wherein Xaas is Pro; 




Txl.5: 


SEQ ID NO:227, wherein Xaa, is Glu, Xaaj is Pro and Xaag is Gin; 




Tl.l: 


SEQ ID NO:228, wherein Xaaj is Pro; 




Vrl.3: 


SEQ ID NO:229, wherein Xaas is Pro; and 




Tb: 


SEQ ID NO:230, wherein Xaa, is Lys and Xaaj is Pro. 



30 The C-terminus preferably contains a carboxyl group for the peptides OB-29, Tx 1 . 1 , R 1 . 1 A, Rl . 1 B, 
Om-9, Om-10, Om-21, Om-25, Om-27, Om-28, Crl.2, Om-35, Btl.l, Btl.3, Btl.5, A1.4, A1.6, 
Ar 1 .2, Ar 1 .3, Ar 1 A, Arl .5, Ar 1 .6, Cal .3, Cal .4, Epl .2, Lv 1 .9, 0 1 .2, Cm 1 .3, Cm 1 .6, P 1 .6, Rgl . 1 , 



it.. E 



15 



lU 



20 



25 
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Rgl.3, Rgl.4, Sml .5, Txl.5 and Vrl.3. The C-terminus of the other peptides preferably contains 
an amide group. 

■ h o- nrpgent inve n t i on i s a ls o et tr euled LU die uuvel specific a - oontoxin pcpliJus having Ij ; 
formiHas: 

Cys-Cys-Thr-Ile-Xaa5-Ser-Cys-Xaa4-Xaa,-Xaa2-Xaa2-Xaa2-Ile-Xaa2-Ala-Cys-V^KPhe(SEQ 
IDNO:231)and 

Gly-Cys-Cys-Gly-Asn-Xaa5-Ala-Cys-Ser-Gly-Ser-Ser-Xaa2-Asp-Ala^aa5-Ser-Cys (SEQ 
ID NO:232), 

wherein Xaa, is Glu or y-carboxy-Glu (Gla); Xaa2 is Lys, N-metilyl-Lys, N,N-dimethyNLys or 
N,N,N-trimethyl-Lys; Xaa4 is Tyr, nor-Tyr, mono-halo-Tyr, dj^Halo-Tyr, O-sulpho-Tyr, O-phospho- 
Tyr or nitro-Tyr; and Xaa5 is Pro or hydroxy-Pro; and th^/C-terminus contains a carboxyl or amide 
group. The halo is preferably bromine, chlorine or iodine, more preferably iodine for Tyr. In 
addition, the His residues may be substituted worn halo-His; the Arg residues may be substituted by 
Lys, ornithine, homoargine, N-methyl-I^, N,N-dimethyl-Lys, N,N,N-trimethyl-Lys or any 
unnatural basic amino acid; the Lys revues may be substituted by Arg, ornithine, homoargine, N- 
methyl-Lys, N,N-dimethyl-Lys, ]>v^,N-trimethyl-Lys or any unnatural basic amino acid; the Tyr 
residues may be substituted wifh any unnatural hydroxy containing amino acid; the Ser residues may 
be substituted with Thr;/tne Thr residues may be substituted with Ser; and the Phe residues may be 
substituted with unnatural aromatic amino acid. The Cys residues may be in D or L 
configuration awfl may optionally be substituted with homocysteine (D or L). The Tyr residues may 
be substituted with the 3-hydroxyl or 2-hydroxyl isomers and corresponding O-sulpho- and O- 
phospjao-derivatives. The acidic amino acid residues may be substituted with any synthetic acidic 
-b^^ ^ute ^ ii e- amino n cid surroga -to, o. g .T ixrtr a 2o iyl-deri^tives"lJf^Qly-mM 

More specifically, the present invention is directed to the following a-conotoxin peptides: 
01 .2: SEQ ID NO:231, wherein Xaaj is Glu, Xaa2 is Lys, Xaa4 is Tyr and Xaa5 is 

Pro; and 

Rgl .12: SEQ ID NO:232, wherein Xaa2 is Lys and Xaas is Pro. 

The C-terminus of GL2 preferably contains a carboxyl group, and the C-terminus of Rgl. 12 
preferably contains an amide group. 

■ plec of unnatural aromat i c ami n o acid include, but aie uuL lHllilbdUi> f:is:^fa-a s-mlF5=^ ^ 
4-su6stituted-Phe wherein the substituent is CrC^^^k^^V^j^BrbQ^^^^ sulphomethyl, 
traler f)henylr'^Hfl7"nCT^I^SOjIF^^ Examploa of unnatural hydroxy containing cun^ ^g' 



• 
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-a cid, include, buL~a r c not limited to, ouch ao ^ " hydroxymothy IPhc, ^ hydi o xypli ( Jliyl-01x>JIA^ 
dimethyl-Tyr and 5-amino-Tyr. Examples of unnatural basic amina-aetds"'1nclml^ but are not 
limited to,N-l-(2-pyrazolinyl)-Arg,2-(4-piperinyl)-Glyj,2-(4^^^ 

Gly and 2-[3-(2S)pyrrolininyl)-Ala. The§p-^^ia other unnatural basic amino acids, unnatural 
hydroxy containing amino acids orjurlfiatural aromatic amino acids are described in Building Block 
Index, Version 3.0 (1999jS^mlog, pages 4-47 for hydroxy containing amino acids and aromatic 
amino acids and »^ges 66-87 for basic amino acids; see also http://www.amino-acidsxom), 
incorporatgd^erein by reference, by and available from RSP Amino Acid Analogues, Inc., 




descriBed above, the Asn residues may be modified to contain an N-glycan and the Ser sfitf Thr 
residues may be modified to contain an O-glycan. In accordance with the pres^rTinvention, a 
glycan shall mean any N-, S- or O-linked mono-, di-, tri-, poly- or olieos^charide that can be 
attached to any hydroxy, amino or thiol group of natural or modified amino acids by synthetic or 
1 5 enzymatic methodologies known in the art. The monosacaharides making up the glycan can include 
D-allose, D-altrose, D-glucose, D-mannose^^-gulose, D-idose, D-galactose, D-talose, D- 
galactosamine, D-glucosamine, D-N-^pefyl-glucosamine (GlcNAc), D-N-acetyl-galactosamine 
(GalNAc), D-fucose or D-arabiiK^s€: These saccharides may be structurally modified, e.g., with one 
or more O-sulfate, O-pJxef^hate, O-acetyK or acidic groups, such as sialic acid, including 
20 combinations ther^erf The gylcan may also include similar polyhydroxy groups, such as D- 
penicillamipe^,5 and halogenated derivatives thereof or polypropylene glycol derivatives. The 
glycpstSic linkage is beta and 1-4 or 1-3, preferably 1-3. The linkage between the glycan and the 
Kii ' i o acid i t my be alplia^r beta, prefer a bly n lphn nnd i n I 

Core O-glycans have been described by Van de Steen et al. (1998), incorporated herein by 
25 reference. Mucin type O-linked oligosaccharides are attached to Ser or Thr (or other hydroxy lated 
residues of the present peptides) by a GalNAc residue. The monosaccharide building blocks and 
the linkage attached to this first GalNAc residue define the "core glycans," of which eight have been 
identified. The type of glycosidic linkage (orientation and connectivities) are defined for each core 
glycan. Suitable glycans and glycan analogs are described further in U.S. Serial No. 09/420,797, 
30 filed 19 October 1999 and in PCT Application No. PCTAJS99/24380, filed 19 October 1999, both 
incorporated herein by reference. A preferred glycan is Gal(pi-*3)GalNAc(al-*). 
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Optionally, in the peptides of general formulas I and II and the specific peptides described 
above, pairs of Cys residues may be replaced pairwise with Ser/(Glu or Asp) or Lys/(Glu or Asp) 
combinations. Sequential coupling by known methods (Barnay et aL, 2000; Hruby et al., 1994; 
Bitan et al., 1997) allows replacement of native Cys bridges with lactam bridges. 

The present invention is further directed to propeptides and nucleic acid sequences encoding 
the propeptides or peptides as described in further detail herein. 



DETAILED DESCRIPTION OF THE INVENTION 

The invention relates to relatively short peptides (termed a-conotoxins herein), about 10-30 
residues in length, which are naturally available in minute amounts in the venom of the cone snails 
or analogous to the naturally available peptides, and which preferably include two disulfide bonds. 

an effective amount of an a-conotoxin peptide. Such a pharmaceutical^^pdftTposition has the 
capability of acting as antagonists for nicotinic acetylcholine r^ee^tors. In one aspect, the a- 
conotoxins with specificity for neuromuscular junctionj;Ht5otinic acetylcholine receptors are used 
as neuromuscular blocking agents for use in cp^jtmction with surgery, as disclosed in U.S. patent 

application Serial No. 09/ , m^d^^ January 2000 (Attorney Docket No. 2314-178.A) and 

international patent apphcaJieifNo. PCT/USOO/ , filed 21 January 2000 (Attorney Docket No. 

23 14-1 38.PCT), e^ijlfmcorporated by reference herein. In a second aspect, additional a-conotoxins 
and uses for^em have been described in U.S. Patent Nos. 4,447,356 (Olivera et al., 1984); 
-S^^gyiSS; 3,j14,/ 7 4, eaeh incorporated - hc i roin-b yn' efeuai^^ 

In a third aspect additional uses for a-conotoxins are described in U.S. Serial No. 
09/219,446, filed 22 December 1998, incorporated herein by reference. In this application, a- 
conotoxins with specificity for neuronal nicotinic acetylcholine receptors are used for treating 
disorders regulated at neuronal nicotinic acetylcholine receptors. Such disorders include, but are 
not limited to, cardiovascular disorders, gastric motility disorders, urinary incontinence, nicotine 
addiction, mood disorders (such as bipolar disorder, unipolar depression, dysthymia and seasonal 
effective disorder) and small cell lung carcinoma, as well as the localization of small cell lung 
carcinoma. 

The a-conotoxin peptides described herein are sufficiently small to be chemically 
synthesized. General chemical syntheses for preparing the foregoing a-conotoxin peptides are 
described hereinafter. Various ones of the a-conotoxin peptides can also be obtained by isolation 
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and purification from specific Conus species using the technique described in U.S. Patent No. 
4,447,356 (Olivera et al., 1984), the disclosure of which is incorporated herein by reference. 

Although the a-conotoxin peptides of the present invention can be obtained by purification 
from cone snails, because the amounts of a-conotoxin peptides obtainable from individual snails 
5 are very small, the desired substantially pure a-conotoxin peptides are best practically obtained in 
commercially valuable amounts by chemical synthesis using solid-phase strategy. For example, the 
yield from a single cone snail may be about 10 micrograms or less of a-conotoxin peptide. By 
"substantially pure" is meant that the peptide is present in the substantial absence of other biological 
molecules of the same type; it is preferably present in an amount of at least about 85% purity and 

1 0 preferably at least about 95% purity. Chemical synthesis of biologically active a-conotoxin peptides 
depends of course upon correct determination of the amino acid sequence. 

The a-conotoxin peptides can also be produced by recombinant DNA techniques well known 
in the art. Such techniques are described by Sambrook et al. (1 989). The peptides produced in this 
manner are isolated, reduced if necessary, and oxidized to form the correct disulfide bonds. 

15 One method of forming disulfide bonds in the conantokin peptides of the present invention 

is the air oxidation of the linear peptides for prolonged periods under cold room temperatures or at 
room temperature. This procedure results in the creation of a substantial amount of the bioactive, 
disulfide-linked peptides. The oxidized peptides are fractionated using reverse-phase high 
performance liquid cliromatography (HPLC) or the like, to separate peptides having different linked 

20 configurations. Thereafter, either by comparing these fractions with the elution of the native material 
or by using a simple assay, the particular fraction having the correct linkage for maximum biological 
potency is easily determined. However, because of the dilution resulting from the presence of other 
fractions of less biopotency, a somewhat higher dosage may be required. 

The peptides are synthesized by a suitable method, such as by exclusively solid-phase 

25 techniques, by partial solid-phase techniques, by fragment condensation or by classical solution 
couplings. 

In conventional solution phase peptide synthesis, the peptide chain can be prepared by a 
series of coupling reactions in which constituent amino acids are added to tlie growing peptide chain 
in the desired sequence. Use of various coupling reagents, e.g., dicyclohexylcarbodiimide or 
30 diisopropylcarbonyldimidazole, various active esters, e.g., esters of N-hydroxyphthalimide or N- 
hydroxy-succinimide, and the various cleavage reagents, to carry out reaction in solution, with 
subsequent isolation and purification of intermediates, is well known classical peptide methodology. 
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Classical solution synthesis is described in detail in the treatise, "Methoden der Organischen Chemie 
(Houben-Weyl): Synthese von Peptiden," (1974). Techniques of exclusively solid-phase synthesis 
are set forth in the textbook, "Solid-Phase Peptide Synthesis," (Stewart and Young, 1969), and are 
exemplified by the disclosure of U.S. Patent 4,105,603 (Vale et al., 1978). The fragment 
5 condensation method of synthesis is exemplified in U.S. Patent 3,972,859 (1 976). Other available 
syntheses are exemplified by U.S. Patents No. 3,842,067 (1974) and 3,862,925 (1975). The 
synthesis of peptides containing ycarboxyglutamic acid residues is exemplified by Rivier et al. 
(1987), Nishiuchi et al. (1993) and Zhou et al. (1996). 

Common to such chemical syntheses is the protection of the labile side chain groups of the 

10 various amino acid moieties with suitable protecting groups which will prevent a chemical reaction 
from occurring at that site until the group is ultimately removed. Usually also common is the 
protection of an a-amino group on an amino acid or a fragment while that entity reacts at the 
carboxyl group, followed by the selective removal of the a-amino protecting group to allow 
subsequent reaction to take place at that location. Accordingly, it is common that, as a step in such 

1 5 a synthesis, an intermediate compound is produced which includes each of the amino acid residues 
located in its desired sequence in the peptide chain with appropriate side-chain protecting groups 
linked to various ones of the residues having labile side chains. 

As far as the selection of a side chain amino protecting group is concerned, generally one 
is chosen which is not removed during deprotection of the a-amino groups during the synthesis. 

20 However, for some amino acids, e.g.. His, protection is not generally necessary. In selecting a 
particular side chain protecting group to be used in the synthesis of the peptides, the following 
general rules are followed: (a) the protecting group preferably retains its protecting properties and 
is not split off under coupling conditions, (b) the protecting group should be stable under the 
reaction conditions selected for removing the a-amino protecting group at each step of the synthesis, 

25 and (c) the side chain protecting group must be removable, upon the completion of the synthesis 
containing the desired amino acid sequence, under reaction conditions that will not undesirably alter 
the peptide chain. 

It should be possible to prepare many, or even all, of these peptides using recombinant DNA 
technology. However, when peptides are not so prepared, they are preferably prepared using the 
30 Merrifield solid-phase synthesis, although other equivalent chemical syntheses known in the art can 
also be used as previously mentioned. Solid-phase synthesis is commenced from the C-terminus 
of the peptide by coupling a protected a-amino acid to a suitable resin. Such a starting material can 
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be prepared by attaching an a-amino-protected amino acid by an ester linkage to a chloromethylated 
resin or a hydroxymethyl resin, or by an amide bond to a benzhydrylamine (BHA) resin or para- 
methylbenzhydrylamine (MBHA) resin. Preparation of the hydroxymethyl resin is described by 
Bodansky et al. (1966). Chloromethylated resins are commercially available from Bio Rad 
5 Laboratories (Richmond, CA) and from Lab. Systems, Inc. The preparation of such a resin is 
described by Stewart and Young (1969). BHA and MBHA resin supports are commercially 
available, and are generally used when the desired polypeptide being synthesized has an 
unsubstituted amide at the C-terminus. Thus, solid resin supports may be any of those known in the 
art, such as one having the formulae -O-CHj-resin support, -NH BHA resin support, or -NH-MBHA 

10 resin support. When the unsubstituted amide is desired, use of a BHA or MBHA resin is preferred, 
because cleavage directly gives the amide. In case the N-methyl amide is desired, it can be 
generated from an N-methyl BHA resin. Should other substituted amides be desired, the teaching 
of U.S. Patent No. 4,569,967 (Komreich et al., 1986) can be used, or should still other groups than 
the free acid be desired at the C-terminus, it may be preferable to synthesize the peptide using 

15 classical methods as set forth in the Houben-Weyl text (1974). 

The C-terminal amino acid, protected by Boc or Fmoc and by a side-chain protecting group, 
if appropriate, can be first coupled to a chloromethylated resin according to the procedure set forth 
in K. Horiki et al. (1978), using KF in DMF at about 60*'C for 24 hours with stirring, when a peptide 
having free acid at the C-terminus is to be synthesized. Following the coupling of the BOC- 

20 protected amino acid to the resin support, the a-amino protecting group is removed, as by using 
trifluoroacetic acid (TFA) in methylene chloride or TFA alone. The deprotection is carried out at 
a temperature between about O'^C and room temperature. Other standard cleaving reagents, such as 
HCl in dioxane, and conditions for removal of specific a-amino protecting groups may be used as 
described in Schroder & Lubke (1965). 

25 After removal of the a-amino-protecting group, the remaining a-amino- and side chain- 

protected amino acids are coupled step-wise in the desired order to obtain the intermediate 
compound defined hereinbefore, or as an alternative to adding each amino acid separately in the 
synthesis, some of them may be coupled to one another prior to addition to the solid phase reactor. 
Selection of an appropriate coupling reagent is within the skill of the art. Particularly suitable as 

30 a coupling reagent is N,N'-dicyclohexylcarbodiimide (DCC, DIC, HBTU, HATU, TBTU in the 
presence of HoBt or HoAt). 
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The activating reagents used in the solid phase synthesis of the peptides are well known in 
the peptide art. Examples of suitable activating reagents are carbodiimides, such as N,N'- 
diisopropylcarbodiimide and N-ethyl-N*-(3-dimethylaminopropyl)carbodiimide. Other activating 
reagents and their use in peptide coupling are described by Schroder & Lubke (1965) and Kapoor 
5 (1970). 

Each protected amino acid or amino acid sequence is introduced into the solid-phase reactor 
in about a twofold or more excess, and the coupling may be carried out in a medium of 
dimethylformamide (DMF):CH2Cl2 (1:1) or in DMF or CH2CI2 alone. In cases where intermediate 
coupling occurs, the coupling procedure is repeated before removal of the a-amino protecting group 
ifr=. 10 prior to the coupling of the next amino acid. The success of the coupling reaction at each stage of 
"j: the synthesis, if performed manually, is preferably monitored by the ninhydrin reaction, as described 

^ by Kaiser et al. (1970). Coupling reactions can be performed automatically, as on a Beckman 990 

SI automatic synthesizer, using a program such as that reported in Rivier et al. (1978). 

After the desired amino acid sequence has been completed, the intermediate peptide can be 
15 removed from the resin support by treatment with a reagent, such as liquid hydrogen fluoride or 

1=1 

1=^= TFA (if using Fmoc chemistry), which not only cleaves the peptide from the resin but also cleaves 

10 all remaining side chain protecting groups and also the a-amino protecting group at the N-terminus 

if it was not previously removed to obtain the peptide in the form of the free acid. If Met is present 
in the sequence, the Boc protecting group is preferably first removed using trifluoroacetic acid 
20 (TFA)/ethanedithiol prior to cleaving the peptide from the resin with HF to eliminate potential S- 
alkylation. When using hydrogen fluoride or TFA for cleaving, one or more scavengers such as 
anisole, cresol, dimethyl sulfide and methylethyl sulfide are included in the reaction vessel. 

Cyclization of the linear peptide is preferably affected, as opposed to cyclizing the peptide 
while a part of the peptido-resin, to create bonds between Cys residues. To effect such a disulfide 
25 cyclizing linkage, fully protected peptide can be cleaved from a hydroxymethylated resin or a 
chloromethylated resin support by ammonolysis, as is well known in the art, to yield the fully 
protected amide intermediate, which is thereafter suitably cyclized and deprotected. Alternatively, 
deprotection, as well as cleavage of the peptide from the above resins or a benzhydrylamine (BHA) 
resin or a methylbenzhydrylamine (MBHA), can take place at 0°C with hydrofluoric acid (HF) or 
30 TFA, followed by oxidation as described above. 
I ^ lO\ ^j^ iQ^^optid gS die uLu :;y n th G 5i7pd ■ using an aut om ette-^rn lhesizei. Aniil fg''^gixls „afe' 
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4cFminu3 using an Advan e ed Chemtcch -3 57 AutoiuailC Peptide SyntliCsizci.^Cc:uj^jjj T gLdAi^ g^^^ 
out using 1,3-diisopropylcarbodimide in N-methylpyiTOljdigefie=^^MP or by 2-(lH-benzotriazole- 
l-yl)-l,l,3,3-tetramethyluronium hexaflu^^^^^phate (HBTU) and diethylisopro- pylethylamine 
(DIEA). The FMOC protectj^g=^foup is removed by treatment with a 20% solution of piperidine 
in dimethylfQ^^^^e(DMF). Resins are subsequently washed with DMF (twice), followed by 

Pharmaceutical compositions containing a compound of the present invention or its 
pharmaceutical ly acceptable salts as the active ingredient can be prepared according to conventional 
pharmaceutical compounding techniques. See, for example, Remington's Pharmaceutical Sciences, 
18th Ed. (1990, Mack Publishing Co., Easton, PA). Typically, an antagonistic amount of the active 
ingredient will be admixed with a pharmaceutically acceptable carrier. The carrier may take a wide 
variety of forms depending on the form of preparation desired for administration, e.g., intravenous, 
oral or parenteral. The compositions may further contain antioxidizing agents, stabilizing agents, 
preservatives and the like. 

For oral administration, the compounds can be formulated into solid or liquid preparations 
such as capsules, pills, tablets, lozenges, melts, powders, suspensions or emulsions. In preparing 
the compositions in oral dosage form, any of the usual pharmaceutical media may be employed, 
such as, for example, water, glycols, oils, alcohols, flavoring agents, preservatives, coloring agents, 
suspending agents, and the like in the case of oral liquid preparations (such as, for example, 
suspensions, elixirs and solutions); or carriers such as starches, sugars, diluents, granulating agents, 
lubricants, binders, disintegrating agents and the like in the case of oral solid preparations (such as, 
for example, powders, capsules and tablets). Because of their ease in administration, tablets and 
capsules represent the most advantageous oral dosage unit form, in which case solid pharmaceutical 
carriers are obviously employed. If desired, tablets may be sugar-coated or enteric-coated by 
standard techniques. The active agent can be encapsulated to make it stable to passage through the 
gastrointestinal tract while at the same time allowing for passage across the blood brain barrier. See 
for example, WO 96/1 1698. 

For parenteral administration, the compound may be dissolved in a pharmaceutical carrier 
and administered as either a solution or a suspension. Illustrative of suitable carriers are water, 
saline, dextrose solutions, fructose solutions, ethanol, or oils of animal, vegetative or synthetic 
origin. The carrier may also contain other ingredients, for example, preservatives, suspending 
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agents, solubilizing agents, buffers and the like. When the compounds are being administered 
intrathecally, they may also be dissolved in cerebrospinal fluid. 

actuaPambunt administered, and the rate and time-course of administration, wilj-.d^end on the 
nature and severity of the condition being treated. Prescription of treajm^ht, e.g. decisions on 
dosage, timing, etc, is within the responsibility of general practitioK^-s or spealists, and typically 
takes account of the disorder to be treated, the condition^afthe individual patient, the site of 
delivery, the method of administration and other fafimrs known to practitioners. Examples of 
techniques and protocols can be found in Rprfmgton Parmaceutical Sciences. Typically the 
conopeptides of the present invention ejdlibit their effect at a dosage range from about 0.001 mg/kg 
to about 250 mg/kg, preferablv^dm about 0.05 mg/kg to about 100 mg/kg of the active ingredient, 
more preferably from a hi^tuO.l mg/kg to about 75 mg/kg. A suitable dose can be administered in 
multiple sub-dose^-^r day. Typically, a dose or sub-dose may contain from about 0. 1 mg to about 
500 mg ofrti^ctive ingredient per unit dosage form. A more preferred dosage will contain from 
aboutj^^ mg to about 1 00 mg of active ingredient per unit dosage form. Dosages are generally 

Alternatively, targeting therapies may be used to deliver the active agent more specifically 
to certain types of cell, by the use of targeting systems such as antibodies or cell specific ligands. 
Targeting may be desirable for a variety of reasons, e.g. if the agent is unacceptably toxic, or if it 
would otherwise require too high a dosage, or if it would not otherwise be able to enter the target 
cells. 

The active agents, which are peptides, can also be administered in a cell based delivery 
system in which a DNA sequence encoding an active agent is introduced into cells designed for 
implantation in the body of the patient, especially in the spinal cord region. Suitable delivery 
systems are described in U.S. Patent No. 5,550,050 and published PCT Application Nos. WO 
92/19195, WO 94/25503, WO 95/01203, WO 95/05452, WO 96/02286, WO 96/02646, WO 
96/40871 , WO 96/40959 and WO 97/12635. Suitable DNA sequences can be prepared synthetically 
for each active agent on the basis of the developed sequences and the known genetic code. 
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EXAMPLES 

The present invention is described by reference to the following Examples, which are offered 
by way of illustration and are not intended to limit the invention in any manner. Standard- 
techniques well known in the art or the techniques specifically described below were utilized. 

5 EXAMPLE 1 

Isolation of a-Conotoxins 
Crude venom was extracted from venom ducts (Cruz et al., 1976), and the components were 
purified as previously described (Cartier et aL, 1996a). The crude extract from venom ducts was 
purified by reverse phase liquid chromatography (RPLC) using a Vydac C,8 semi-preparative 
.'palO column (10 x 250 mm) and elution with a linear gradient of acetonitrile in 0.1% TFA. Further 
purification of bioactive peaks was done on a Vydac Cjg analytical column (4.6 x 220 mm) eluted 
M with a gradient of acetonitrile in 0.1% TFA. The effluents were monitored at 220 nm. Peaks were 

collected, and aliquots were assayed for activity. Activity was monitored by assessing block of 
' a3p4 nAChRs expressed in Xenopus oocytes. 

1^=^ 1 5 The amino acid sequence of the purified peptides were determined by standard methods, 

il The purified peptides were reduced and alkylated prior to sequencing by automated Edman 

!- s I 

degradation on an Applied Biosystems 477 A Protein Sequencer with a 120 A Analyzer 
^ (DNA/Peptide Facility, University of Utah) (Martinez et aL, 1995; Shon et al., 1994). 

In accordance with this method, peptides Mil, AuIA, AuIB, AuIC, MAR-1 , MAR-2, TI, OB- 
20 29, EpI, S 1 . 1 , Bnl , 1 , Bnl .2, Cal . 1 , Cal .2, Cnl . 1 , Cnl .2 and Sml .3 were obtained. 



^ separately pe 



EXAMPLE 2 
Synthesis of Conopeptides 
' 'syiiLlies T S - of conopep t i de S y ^ither the mature toxins - or - tiic pt :GGu rDo i^ ^ :>tfp(jiJ^^ 
Iperfoimed using conventional protection chemistry as describ^d-byti^artier et al. (1996). 
25 Briefly, the linear chains were built on Rink amide resinj^y^lfioc procedures with 2-( 1 H-benzotriol- 
l-yl)-l,l,3,3,-tetramethyluronium tetrafluofoSorated coupling using .an ABI model 43 OA peptide 
sythesizer with amino aci^--d€nvatives purchased from Bachem (Torrence CA). Orthogonal 
protection wasyseaon cysteines: Cys^ and Cys were protected as the stable Cys(S- 
acetarmdcrmethyl), while Cys^ and Cys^ were protected as the acid-labile Cys(S-trityl). After 
30 ^^ Efr o'vaU c^fthrim^^to fe^^ ^ b ^^ grtg up^ nd ' ^ l ear ^a^g^r^^ 
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p H ^ pfiHnr ^ i rnrn prpripitatf^H Ky filtprinp tt^ f jr g> ^tCTWW^ ture jpt O . 1 0°C-fnethvl t butvl 

which removed the protecting groups except on Cys^ and Cys'^. The^^^pti^eTwere dissolved in 
0.1% TFA and 60% acetonitrile and purified^byJLPfee"^rraVydac C,8 preparative column (22 x 250 

ilfidp hridse>fr4a"theiitit;e couopepliUtrs-were-fi n mcd iVi rld .t i il>r(1 inj^^i,! yiu^bil' 
(1996X "firiefly, the disulfide bridges between Cys- and Cys5..:^ieFe-fblTOe3'^ air oxidation which 
was judged to be complete by analytical RPL^.-^^^iemonocyclic peptides were purified by RPLC 
on a Vydac Cjg prepartive column ^Ji-^TzSO mm) and eluted with a gradient of acetonitrile in 0.1% 
TFA. Removal of S-acetaipkl^methyl groups and closure of the disulfide bridge between Cys-' and 
Cys'^ was carried ouf"simultaneously be iodine oxidation. The cyclic peptides were purified by 
RPLC on ^>^ac C,8 prepartive column (22 x 250 mm) and eluted with a gradient of acetonitrile 





EXAMPLE 3 
Isolation of DNA Encoding a-Conotoxins 
DNA coding for a-conotoxins was isolated and cloned in accordance with conventional 
techniques using general procedures well known in the art, such as described in Olivera et al. (1996). 
Alternatively, cDNA libraries was prepared from Conus venom duct using conventional techniques. 
DNA from single clones was amplified by conventional techniques using primers which correspond 
approximately to the Ml 3 universal priming site and the Ml 3 reverse universal priming site. Clones 
having a size of approximately 300 nucleotides were sequenced and screened for similarity in 
sequence to known a-conotoxins. The DNA sequences and encoded propeptide or peptide 
sequences are set forth in Tables 1-134. 



TABLE 1 

DNA Sequence (SEQ ID NO:58) and Protein Sequence (SEQ ID NO:59) of Mil 

atg ttc ace gtg ttt ctg ttg gtt gtc ttg gca acc act gtc gtt tec 
Met Phe Thr Val Phe Leu Leu Val Val Leu Ala Thr Thr Val Val Ser 



ttc ^cct tea gat 
Phe Pro Ser Asp 

aaa geg tet gac 
Lys Ala Ser Asp 

cet gtc tgt cac 
Pro Val Cys His 



cgt gca tet gat 
Arg Ala Ser Asp 

gtg ate aeg ctg 
Val lie Thr Leu 

ttg gag cat tea 
Leu Glu His Ser 



ggc agg aat gee 
Gly Arg Asn Ala 

gee etc aag gga 
Ala Leu Lys Gly 

aac ctt tgt ggt 
Asn Leu Cys Gly 



gca gee aac gac 
Ala Ala Asn Asp 

tgc tgt tec aac 
Cys Cys Ser Asn 

aga aga egc 
Arg Arg Arg 
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tgatgctcca ggaccctctg aaccacgacg ttcgagca 



TABLE 2 

DNA Sequence (SEQ ID NO:60) and Protein Sequence (SEQ ID NO:61) of AuIA 





atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gca 


acc 


acc 


gtc 


gtt 


tec 


5 


Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Thr 


Val 


Val 


Ser 




ttc 


act 


tea 


gat 


cgt 


gca 


tet 


gat 


ggc 


agg 


aag 


gac 


gca 


gcg 


tet 


ggc 




Phe 


Thr 


Ser 


Asp 


Arg 


Ala 


Ser 


Asp 


Gly 


Arg 


Lys 


Asp 


Ala 


Ala 


Ser 


Gly 


10 


ctg 


ate 


get 


ctg 


ace 


ate 


aag 


gga 


tge 


tgt 


tet 


tat 


cet 


eee 


tgt 


ttc 


Leu 


He 


Ala 


Leu 


Thr 


He 


Lys 


Gly 


Cys 


Cys 


Ser 


Tyr 


Pro 


Pro 


Cys 


Phe 




gcg 


act 


aat 


tea 


gae 


tat 


tgt 


ggt 


tgaegaeget 


gatgctccag gaecetctga 




Ala 


Thr 


Asn 


Ser 


Asp 


Tyr 


Cys 


Gly 



















accacgacgt 



TABLE 3 

15 DNA Sequence (SEQ ID NO:62) and Protein Sequence (SEQ ID NO:63) of AuIB 





atg 


ttc 


ace 


gtg 


ttt 


ctg 


ttg 


gtc 


gtc 


ttg 


gca 


ace 


ace 


gtc 


gtt 


tee 




Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Thr 


Val 


Val 


Ser 




ttc 


act 


tea 


gat 


cgt 


gca 


tet 


gat 


ggc 


agg 


aag 


gac 


gca 


gcg 


tet 


ggc 




Phe 


Thr 


Ser 


Asp 


Arg 


Ala 


Ser 


Asp 


Gly 


Arg 


Lys 


Asp 


Ala 


Ala 


Ser 


Gly 


20 


ctg 


att 


get 


ctg 


acc 


atg 


aag 


gga 


tge 


tgt 


tet 


tat 


cet 


eee 


tgt 


ttc 




Leu 


He 


Ala 


Leu 


Thr 


Met 


Lys 


Gly 


Cys 


Cys 


Ser 


Tyr 


Pro 


Pro 


Cys 


Phe 




gcg 


act 


aat 


cea 


gac 


tgt 


ggt 


ega 


ega 


ege 


tgatgctcca ^ 


ggaccctctg 




Ala 


Thr 


Asn 


Pro 


Asp 


Cys 


Gly 


Arg 


Arg 


Arg 















aaccacgacg t 



25 ■ TABLE 4 

DNA Sequence (SEQ ID NO:64) and Protein Sequence (SEQ ID NO:65) of TxL3 





atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gca 


ace 


acc 


gtc 


gtt 


tec 




Met 


Phe ■ 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Thr 


Val 


Val 


Ser 


30 


tte 


tet 


tea 


ggt 


cgt 


agt 


aca 


ttt 


cgt 


ggc 


agg 


aat 


gee 


gca 


gee 


aaa 


Phe 


Ser 


Ser 


Gly 


Arg 


Ser 


Thr 


Phe 


Arg 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Lys 




gcg 


tet 


ggc 


ctg 


gtc 


agt 


ctg 


act 


gae 


agg 


aga 


cea 


gaa 


tge 


tgt 


agt 




Ala 


Ser 


Gly 


Leu 


Val 


Ser 


Leu 


Thr 


Asp 


Arg 


Arg 


Pro 


Glu 


Cys 


Cys 


Ser 




gat 


cet 


ege 


tgt 


aac 


teg 


agt 


eat 


cea 


gaa 


ett 


tgt 


ggt 


gga 


aga 


ege 




Asp 


Pro 


Arg 


Cys 


Asn 


Ser 


Ser 


His 


Pro 


Glu 


Leu 


Cys 


Gly 


Gly 


Arg 


Arg 
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tgatgctcca ggaccctctg aaccacgacg t 
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TABLE 5 

DNA Sequence (SEQ ID NO:66) and Protein Sequence (SEQ ID NO:67) of Txl .2 





atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gca acc 


gcc 


gtc gtt tec 




Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala Thr 


Ala 


Val Val Ser 


5 


ttc 


act 


tea 


gat 


cgt 


gca 


tct 


gat 


gae 


ggg 


aaa gee 


get 


gcg tct gae 




Phe 


Thr 


Ser 


Asp 


Arg 


Ala 


Ser 


Asp 


Asp 


Gly 


Lys Ala 


Ala 


Ala Ser Asp 




ctg 


ate 


act 


ctg 


ace 


ate 


aag 


gga 


tgc 


tgt 


tct cgt 


cct 


cec tgt ate 




Leu 


He 


Thr 


Leu 


Thr 


He 


Lys 


Gly 


Cys 


Cys 


Ser Arg 


Pro 


Pro Cys He 


10 


gcg 


aat 


aat 


cea 


gae 


ttg 


tgt 


ggt 


tgaegaegct gatgctceag aaeggtctga 


Ala 


Asn 


Asn 


Pro 


Asp 


Leu 


Cys 


Gly 













accacgacgt tcgagcaatg ttcaccgtgt ttctgttggt tgtctt 

TABLE 6 

DNA Sequence (SEQ ID NO:68) and Protein Sequence (SEQ ID NO:69) of TxLl 



15 


atg 


ttc 


ace 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gca 


acc 


acc 


gtc 


gtt 


tec 


Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Thr 


Val 


Val 


Ser 




ttc 


act 


tea 


ggt 


cgt 


agt 


aca 


ttt 


cgt 


ggc 


agg 


aat 


gee 


gca 


gee 


aaa 




Phe 


Thr 


Ser 


Gly 


Arg 


Ser 


Thr 


Phe 


Arg 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Lys 




gcg 


tct 


ggc 


ctg 


gtc 


agt 


ctg 


act 


gae 


agg 


aga 


cca 


caa 


tgc 


tgt 


tct 




Ala 


Ser 


Gly 


Leu 


Val 


Ser 


Leu 


Thr 


Asp 


Arg 


Arg 


Pro 


Gin 


Cys 


Cys 


Ser 


20 


eat 


cct 


gee 


tgt 


aac 


gta 


gat 


cat 


cca 


gaa 


att 


tgt 


cgt 


tgaagacgct 




His 


Pro 


Ala 


Cys 


Asn 


Val 


Asp 


His 


Pro 


Glu 


He 


Cys 


Arg 









gatgctceag gaecctctga accacgacgt 

TABLE 7 

DNA Sequence (SEQ ID NO:70) and Protein Sequence (SEQ ID NO:71) of Rl.l A 



atg 
Met 


ttc 
Phe 


acc 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


acc 
Thr 


ace 
Thr 


gtc gtt tee 
Vai Val Ser 


ttc 
Phe 


act 
Thr 


tea 
Ser 


ggt 
Gly 


cgt 
Arg 


cgt 
Arg 


aca 
Thr 


ttt 
Phe 


cat 
His 


ggc 
Gly 


agg 
Arg 


aat 
Asn 


gcc 
Ala 


gca gcc aaa 
Ala Ala Lys 


gcg 
Ala 


tct 
Ser 


ggc 
Gly 


ctg 
Leu 


gtc 
Val 


agt 
Ser 


ctg 
Leu 


act 
Thr 


gae 
Asp 


agg 
Arg 


aga 
Arg 


cca 
Pro 


gaa 
Glu 


tgc tgt tct 
Cys Cys Ser 


cat 
His 


cct 
Pro 


gcc 
Ala 


tgt 
Cys 


aac 
Asn 


gta 
Val 


gat 
Asp 


cat 
His 


cca 
Pro 


gaa 
Glu 


att 
He 


tgt 
Cys 


cgt 
Arg 


tgaagacgct 



gatgctceag gaecctctga accacgacgt 

TABLE 8 

35 DNA Sequence (SEQ ID NO:72) and Protein Sequence (SEQ ID NO:73) of Rl.lB 



atg ttc ace gtg ttt ctg ttg gtt gtc ttg gca acc ace gtc gtt tec 
Met Phe Thr Val Phe Leu Leu Val Val Leu Ala Thr Thr Val Val Ser 



34 



ttc 


act 


tea 


ggt 


cgt 


agt 


aca 


ttt 


cgt 


ggc 


agg 


aat 


gcc 


gca 


gcc 


aaa 


Phe 


Thr 


Ser 


Gly 


Arg 


Ser 


Thr 


Phe 


Arg 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Lys 


gcg 


tct 


ggc 


ctg 


gtc 


agt 


ctg 


act 


gac 


agg 


aga 


cca 


caa 


tgc 


tgt 


tct 


Ala 


Ser 


Gly 


Leu 


Val 


Ser 


Leu 


Thr 


Asp 


Arg 


Arg 


Pro 


Gin 


Cys 


Cys 


Ser 


cat 


cct 


gcc 


tgt 


aac 


gta 


gat 


cat 


cca 


gaa 


att 


tgc 


gat 


tgaagacgct 


His 


Pro 


Ala 


Cys 


Asn 


Val 


Asp 


His 


Pro 


Glu 


He 


Cys 


Asp 









gatgctccag gaccctctga accacgacgt 



TABLE 9 



DNA Sequence (SEQ ID NO:74) and Protein Sequence (SEQ ID NO:75) of S 1.1 



atg 
Met 


ttc 
Phe 


act 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


ate 
He 


act 
Thr 


gtc 
Val 


gtt 
Val 


tec 
Ser 


ttc 
Phe 


cct 
Pro 


tta 
Leu 


gat 
Asp 


cgt 
Arg 


gaa 
Glu 


tct 
Ser 


gat 
Asp 


ggc 
Gly 


gcg 
Ala 


aat 
Asn 


gcc 
Ala 


gaa 
Glu 


gcc 
Ala 


cgc 
Arg 


ace 
Thr 


cac 
His 


gat 
Asp 


cat 
His 


gag 
Glu 


aag 
Lys 


cac 
His 


gca 
Ala 


ctg 
Leu 


gac 
Asp 


egg 
Arg 


aat 
Asn 


gga 
Gly 


tgc 
Cys 


tgt 
Cys 


agg 
Arg 


aat 
Asn 


cct 
Pro 


gcc 
Ala 


tgt 
Cys 


gag 
Glu 


age 
Ser 


cac 
His 


aga 
Arg 


tgt 
Cys 


ggt 
Gly 


tgacgacget < 


gatgctccag 





gaccctctga accacgacgt tcgagca 



TABLE 10 

DNA Sequence (SEQ ID NO:76) and Protein Sequence (SEQ ID NO:77) of Bnl.l 



atg 
Met 


ttc 
Phe 


ace 
Thr 


atg 
Met 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


ace 
Thr 


act 
Thr 


gtc 
Val 


gtt 
Val 


tee 
Ser 


ttc 
Phe 


get 
Ala 


tea 
Ser 


gat 
Asp 


cgt 
Arg 


gca 
Ala 


tct 
Ser 


gat 
Asp 


ggc 
Gly 


agg 
Arg 


aat 
Asn 


gee 
Ala 


gca 
Ala 


gee 
Ala 


aag 
Lys 


gac 
Asp 


aaa 
Lys 


gcg 
Ala 


tct 
Ser 


gac 
Asp 


ctg 
Leu 


gtc 
Val 


get 
Ala 


ctg 
Leu 


ace 
Thr 


gtc 
Val 


aag 
Lys 


gga 
Gly 


tgc 
Cys 


tgt 
Cys 


tct 
Ser 


eat 
His 


cct 
Pro 


gee 
Ala 


tgt 
Cys 


age 
Ser 


gtg 
Val 


aat 
Asn 


aat 
Asn 


cca 
Pro 


gac 
Asp 


att 
He 


tgt 
Cys 


ggt 
Gly 


tgaagacgct 





gatgctccag gaccctctga accacgacgt tegagea 



TABLE 1 1 

DNA Sequence (SEQ ID NO:78) and Protein Sequence (SEQ ID NO:79) of Bnl.2 

aaa gaa tgc tgt act eat cct gee tgt cac gtg agt cat cca gaa etc 
Lys Glu Cys Cys Thr His Pro Ala Cys His Val Ser His Pro Glu Leu 

tgt ggt tgaaaagega cgtgacgctc caggaccetc tgaaecacga cgttegagea 
Cys Gly 
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TABLE 12 

DNA Sequence (SEQ ID NO:80) and Protein Sequence (SEQ ID NO:81) of Bnl.3 





atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gca 


act get gtt 


ctt 


cca 




Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala 


Thr Ala Val 


Leu 


Pro 


5 


gtc 


act 


tta 


gat 


cgt 


gca 


tct 


gat 


gga 


agg 


aat 


gca gca gee 


aac 


gee 




Val 


Thr 


Leu 


Asp 


Arg 


Ala 


Ser 


Asp 


Gly 


Arg 


Asn 


Ala Ala Ala 


Asn 


Ala 




aaa 


acg 


cct 


cgc 


ctg 


ate 


gcg 


cca 


ttc 


ate 


agg 


gat tat tgc 


tgt 


cat 




Lys 


Thr 


Pro 


Arg 


Leu 


He 


Ala 


Pro 


Phe 


He 


Arg 


Asp Tyr Cys 


Cys 


His 


10 


aga 


ggt 


ccc 


tgt 


atg 


gta 


tgg 


tgt 


ggt 


tgaagccgct gctgctccag 




Arg 


.Gly 


Pro 


Cys 


Met 


Val 


Trp 


Cys 


Gly 













gaccctctga accac 

TABLE 13 

DNA Sequence (SEQ ID NO:82) and Protein Sequence (SEQ ID NO:83) of Cal.l 
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atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gca 


ace 


act 


gtg 


gtt 


tec 


Met 


Phe 


Thr 


Val 


Phe 


Leu . 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Thr 


Val 


Val 


Ser 




ttc 


act 


tea 


gat 


cgt 


get 


tct 


gat 


gge 


agg 


aat 


gee 


gca 


gee 


aac 


gcg 




Phe 


Thr 


Ser 


Asp 


Arg 


Ala 


Ser 


Asp 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Asn 


Ala 




ttt 


gae 


ctg 


ate 


get 


ctg 


ate 


gee 


agg 


caa 


aat 


tgc 


tgt 


age 


att 


ccc 




Phe 


Asp 


Leu 


He 


Ala 


Leu 


He 


Ala 


Arg 


Gin 


Asn 


Cys 


Cys 


Ser 


He 


Pro 


20 


age 


tgt 


tgg 


gag 


aaa 


tat 


aaa 


tgt 


agt 


taa 
















Ser 


Cys 


Trp 


Glu 


Lys 


Tyr 


Lys 


Cys 


Ser 

















TABLE 14 

DNA Sequence (SEQ ID NO:84) and Protein Sequence (SEQ ID NO:85) of Cal .2 



atg 
Met 


ttc 
Phe 


acc 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


ace 
Thr 


act 
Thr 


gtg 
Val 


gtt 
Val 


tec 
Ser 


ttc 
Phe 


act 
Thr 


tea 
Ser 


gat 
Asp 


cgt 
Arg 


gcg 
Ala 


tct 
Ser 


gaa 
Glu 


gge 
Gly 


agg 
Arg 


aat 
Asn 


get 
Ala 


gca 
Ala 


gee 
Ala 


aag 
Lys 


gae 
Asp 


aaa 
Lys 


gcg 
Ala 


tct 
Ser 


gae 
Asp 


ctg 
Leu 


gtg 
Val 


get 
Ala 


ctg 
Leu 


aca 
Thr 


gtc 

Val 


agg 
Arg 


gga 
Gly 


tgc 
Cys 


tgt 
Cys 


gee 
Ala 


att 
He 


cgt 
Arg 


gaa 
Glu 


tgt 
Cys 


cgc 
Arg 


ttg 
Leu 


eag 
Gin 


aat 
Asn 


gca 
Ala 


gcg 
Ala 


tat 
Tyr 


tgt 
Cys 


ggt 
Gly 


gga 
Gly 


ata 
He 


tae 
Tyr 





tgatgetcca ggaeeetetg aaccacgacg 

TABLE 15 

35 DNA Sequence (SEQ ID NO:86) and Protein Sequence (SEQ ID NO:87) of TIB 



atg ttc acc gtg ttt ctg ttg gtt gtc ttg gca acc act gtc gtt tec 
Met Phe Thr Val Phe Leu Leu Val Val Leu Ala Thr Thr Val Val Ser 
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ttc 


cct 


tea 


gat 


att 


gea 


act 


gag 


ggc 


agg 


aat 


gcc 


gea 


gcc 


aaa 


gcg 


Phe 


Pro 


Ser 


Asp 


He 


Ala 


Thr 


Glu 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Lys 


Ala 


ttt 


gac 


ctg 


ata 


tct 


teg 


ate 


gte 


aag 


aaa 


gga 


tgc 


tgt 


tec 


cat 


cct 


Phe 


Asp 


Leu 


He 


Ser 


Ser 


He 


Val 


Lys 


Lys 


Gly 


Cys 


Cys 


Ser 


His 


Pro 


gcc 


tgt 


teg 


ggg 


aat 


aat 


cca 


gaa 


ttt 


tgt 


cgt 


caa 


ggt 


cgc 






Ala 


Cys 


Ser 


Gly 


Asn 


Asn 


Pro 


Glu 


Phe 


Cys 


Arg 


Gin 


Gly 


Arg 







tgatgctcca ggaccctctg aaccacgacg t 



TABLE 16 

DNA Sequence (SEQ ID NO:88) and Protein Sequence (SEQ ID NO:89) of TIA 



atg 
Met 


ttc 
Phe 


acc 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttg 
Leu 


gea 
Ala 


ace 
Thr 


act 
Thr 


gtc 
Val 


gtt 
Val 


tec 
Ser 


ttc 
Phe 


cct 
Pro 


tea 
Ser 


gat 
Asp 


ata 
He 


gea 
Ala 


act 
Thr 


gag 
Glu 


ggc 
Gly 


agg 
Arg 


aat 
Asn 


gcc 
Ala 


gea 
Ala 


gee 
Ala 


aaa 
Lys 


gcg 
Ala 


ttt 
Phe 


gac 
Asp 


ctg 
Leu 


ata 

He 


tct 
Ser 


teg 
Ser 


ate 
He 


gtc 
Val 


agg 
Arg 


aaa 
Lys 


gga 
Gly 


tgc 
Cys 


tgt 
Cys 


tec 
Ser 


aat 
Asn 


cce 
Pro 


gcc 
Ala 


tgt 
Cys 


gcg 
Ala 


ggg 

Gly 


aat 
Asn 


aat 
Asn 


cca 
Pro 


cat 
His 


gtt 
Val 


tgt 
Cys 


cgt 
Arg 


caa 
Gin 


ggt 
Gly 


cgc 
Arg 







tgatgctcca ggaccctctg aaccacgacg t 



TABLE 17 

20 DNA Sequence (SEQ ID NO:90) and Protein Sequence (SEQ ID NO:91) of SILl 





atg 


ttc 


ace 


gtg 


ttt 


ctg 


ttg 


gtt 


gte 


ttg 


gea 


acc 


ace 


gtc 


gtt 


tec 




Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Thr 


Val 


Val 


Ser 




ttc 


aat 


tea 


gat 


cgt 


gat 


cca 


gea 


tta 


ggt 


ggc 


agg 


aat 


get 


gea 


gcc 




Phe 


Asn 


Ser 


Asp 


Arg 


Asp 


Pro 


Ala 


Leu 


Gly 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


25 


aaa 


gcg 


tct 


gac 


aag 


ate 


get 


teg 


ace 


etc 


aag 


aga 


aga 


gga 


tgc 


tgt 




Lys 


Ala 


Ser 


Asp 


Lys 


He 


Ala 


Ser 


Thr 


Leu 


Lys 


Arg 


Arg 


Gly 


Cys 


Cys 




teg 


tat 


ttt 


gac 


tgt 


aga 


atg 


atg 


ttt 


cca 


gaa 


atg 


tgt 


ggt 


tgg 


ega 




Ser 


Tyr 


Phe 


Asp 


Cys 


Arg 


Met 


Met 


Phe 


Pro 


Glu 


Met 


Cys 


Gly 


Trp 


Arg 



ggc tgatgctcca ggaccctctg aaccacgacg t 
30 Gly 



TABLE 18 

DNA Sequence (SEQ ID NO:92) and Protein Sequence (SEQ ID NO:93) of S1L2 

atg ttc acc gtg ttt ctg ttg gtt gtc ttg gea ace acc gtc gtt tec 
35 Met Phe Thr Val Phe Leu Leu Val Val Leu Ala Thr Thr Val Val Ser 

ttc aat tea gat cgt gat cca gea tta ggt ggc agg aat get gea gcc 
Phe Asn Ser Asp Arg Asp Pro Ala Leu Gly Gly Arg Asn Ala Ala Ala 



ata gcg tct gac aag ate get teg ace etc agg aga gga gga tgc tgt 
He Ala Ser Asp Lys He Ala Ser Thr Leu Arg Arg Gly Gly Cys Cys 



« « 
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tct ttt cct gcc tgt aga aag tat cgt cca gaa atg tgt ggt gga cga 
Ser Phe Pro Ala Cys Arg Lys Tyr Arg Pro Glu Met Cys Gly Gly Arg 

cgc tgatgctcca ggaccctctg aaccacgacg t 
Arg 



TABLE 19 

DNA Sequence (SEQ ID NO:94) and Protein Sequence (SEQ ID NO:95) of SI 1 .3 

atg ttc acc gtg ttt ctg ttg gtt gtc ttg gca acc acc gtc gtt tec 
Met Phe Thr Val Phe Leu Leu Val Val Leu Ala Thr Thr Val Val Ser 



10 


ttc 


act 


tea 


gat 


cat 


gaa 


tct 


gat 


cgc 


ggt 


gat 


gcc 


caa ace ate 


caa 


Phe 


Thr 


Ser 


Asp 


His 


Glu 


Ser 


Asp 


Arg 


Gly 


Asp 


Ala 


Gin Thr He 


Gin 




gaa 


gtg 


ttt 


gag 


atg 


ttc 


get 


ctg 


gac 


age 


gat 


gga 


tgc tgt tgg 


cat 




Glu 


Val 


Phe 


Glu 


Met 


Phe 


Ala 


Leu 


Asp 


Ser 


Asp 


Gly 


Cys Cys Trp 


His 




cct 


get 


tgt 


ggc 


aga 


cac 


tat 


tgt 


ggt 


cga 


aga 


cgc 


tgatgctcca 






Pro 


Ala 


Cys 


Gly 


Arg 


His 


Tyr 


Cys 


Gly 


Arg 


Arg 


Arg 







fiJlS ggaccctctg aaccacgacg t 



TABLE 20 

DNA Sequence (SEQ ID NO:96) and Protein Sequence (SEQ ID NO:97) of 811 .6 





atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gca 


acc 


acc 


gtc 


gtt 


tec 




Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Thr 


Val 


Val 


Ser 


::i 20 


ttc 


aat 


tea 


gat 


cgt 


gat 


cca 


gca 


tta 


ggt 


ggc 


agg 


aat 


get 


gca 


gcc 




Phe 


Asn 


Ser 


Asp 


Arg 


Asp 


Pro 


Ala 


Leu 


Gly 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 




ata 


geg 


tct 


gac 


aag 


ate 


get 


teg 


acc 


etc 


agg 


aga 


gga 


gga 


tgc 


tgt 




He 


Ala 


Ser 


Asp 


Lys 


He 


Ala 


Ser 


Thr 


Leu 


Arg 


Arg 


Gly 


Gly 


Cys 


Cys 


25 


tct 


ttt 


get 


gcc 


tgt 


aga 


aag 


tat 


cgt 


cca 


gaa 


atg 


tgt 


ggt 


gga 


cga 


Ser 


Phe 


Ala 


Ala 


Cys 


Arg 


Lys 


Tyr 


Arg 


Pro 


Glu 


Met 


Cys 


Gly 


Gly 


Arg 



cgc tgatgct 
Arg 



TABLE 21 

DNA Sequence (SEQ ID NO:98) and Protein Sequence (SEQ ID NO:99) of S11.7 



atg 
Met 


ttc 
Phe 


acc 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


etc 
Leu 


ttg 
Leu 


gca 
Ala 


ace 
Thr 


acc 
Thr 


gtc 
Val 


gtt 
Val 


tec 
Ser 


ttc 
Phe 


aat 
Asn 


tea 
Ser 


gat 
Asp 


cgt 
Arg 


gca 
Ala 


tta 
Leu 


ggt 
Gly 


ggc 
Gly 


agg 
Arg 


aat 
Asn 


get 
Ala 


gca 
Ala 


gcc 
Ala 


aaa 
Lys 


geg 
Ala 


tct 
Ser 


gac 
Asp 


aag 
Lys 


ate 
He 


ett 
Leu 


teg 
Ser 


aac 
Asn 


etc 
Leu 


agg 
Arg 


aga 
Arg 


gga 
Gly 


gga 
Gly 


tgc 
Cys 


tgt 
Cys 


ttt 
Phe 


eat 
His 


cct 
Pro 


gtc 
Val 


tgt 
Cys 


tac 
Tyr 


ate 
He 


aat 
Asn 


ctt 
Leu 


eta 
Leu 


gaa 
Glu 


atg 
Met 


tgt 
Cys 


cgt 
Arg 


caa 
Gin 


cga 
Arg 


ggc 
Gly 





tgatcgtcca ggaccctctg aaccacgacg t 



4 • 
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TABLE 22 

DNA Sequence (SEQ ID NO:100) and Protein Sequence (SEQ ID NO:101) of Cnl.l 





atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


aca 


acc 


act gtc gtt tec 




Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Thr 


Thr 


Thr Val Val Ser 


5 


ttc 


cct 


tea 


gat 


agt 


gca 


tct 


gat 


gtc 


agg 


gat 


gac 


gaa gcc aaa gac 




Phe 


Pro 


Ser 


Asp 


Ser 


Ala 


Ser 


Asp 


Val 


Arg 


Asp 


Asp 


Glu Ala Lys Asp 




gaa 


agg 


tct 


gac 


atg 


tac 


aaa 


teg 


aaa 


egg 


aat 


gga 


cgc tgt tge cat 




Glu 


Arg 


Ser 


Asp 


Met 


Tyr 


Lys 


Ser 


Lys 


Arg 


Asn 


Gly 


Arg Cys Cys His 


10 


cct 


gcc 


tgt 


ggc 


aaa 


cac 


ttt 


agt 


tgt 


gga 


cgc 


tgatgctcca ggaccctctg 


Pro 


Ala 


Cys 


Gly 


Lys 


His 


Phe 


Ser 


Cys 


Gly 


Arg 







aaccacgacg t 

TABLE 23 

DNA Sequence (SEQ ID NO: 102) and Protein Sequence (SEQ ID NO: 103) of SmI 
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atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gca 


acc 


act 


gtc 


gtt 


tec 


Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Thr 


Val 


Val 


Ser 




tec 


cct 


tea 


gat 


cgt 


gca 


tct 


gat 


ggc 


agg 


aat 


gcc 


gca 


gee 


aac 


gag 




Ser 


Pro 


Ser 


Asp 


Arg 


Ala 


Ser 


Asp 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Asn 


Glu 




aaa 


gcg 


tct 


gac 


gtg 


ate 


gcg 


ctg 


gcc 


etc 


aag 


gga 


tge 


tgt 


tee 


aac 




Lys 


Ala 


Ser 


Asp 


Val 


He 


Ala 


Leu 


Ala 


Leu 


Lys 


Gly 


Cys 


Cys 


Ser 


Asn 


20 


cct 


gtc 


tgt 


cac 


ctg 


gag 


eat 


tea 


aac 


atg 


tgt 


ggt 


aga 


aga 


cgc 






Pro 


Val 


Cys 


His 


Leu 


Glu 


His 


Ser 


Asn 


Met 


Cys 


Gly 


Arg 


Arg 


Arg 





tgatgctcca ggaccctctg aaccacgacg 

TABLE 24 

DNA Sequence (SEQ ID NO:104) and Protein Sequence (SEQ ID NO:105) of Btl.l 



atg 
Met 


ttc 
Phe 


tee 
Ser 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


acc 
Thr 


act 
Thr 


gtc 
Val 


gtt 
Val 


tec 
Ser 


tec 
Ser 


act 
Thr 


tea 
Ser 


ggt 
Gly 


ggt 
Gly 


gca 
Ala 


tct 
Ser 


ggt 
Gly 


ggc 
Gly 


agg 
Arg 


aag 
Lys 


get 
Ala 


gca 
Ala 


gcc 
Ala 


aaa 
Lys 


gcg 
Ala 


tct 
Ser 


aac 
Asn 


egg 
Arg 


ate 
He 


get 
Ala 


ctg 
Leu 


ace 
Thr 


gtc 
Val 


agg 
Arg 


agt 
Ser 


gca 
Ala 


aca 
Thr 


tge 
Cys 


tgt 
Cys 


aat 
Asn 


tat 
Tyr 


cct 
Pro 


ccc 
Pro 


tgt 
Cys 


tac 
Tyr 


gag 
Glu 


act 
Thr 


tat 
Tyr 


cea 
Pro 


gaa 
Glu 


agt 
Ser 


tgt 
Cys 


ctg 
Leu 


taaegtgaat 





catecagage tttgtggctg aagacactga tgctceagga cectctgaac cacgaegt 

TABLE 25 

35 DNA Sequence (SEQ ID NO: 106) and Protein Sequence (SEQ ID NO: 107) of Btl.2 



atg ttc ace gtg ttt ctg ttg gtt gtc ttg gca ace act gtg gtt tec 
Met Phe Thr Val Phe Leu Leu Val Val Leu Ala Thr Thr Val Val Ser 
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ttc 


act 


tea 


ggt 


cgt 


gca 


ttt 


cgt 


ggc 


agg 


aat 


cgc 


gca 


gcc 


gac 


gac 


Phe 


Thr 


Ser 


Gly 


Arg 


Ala 


Phe 


Arg 


Gly 


Arg 


Asn 


Arg 


Ala 


Ala 


Asp 


Asp 


aaa 


agg 


tct 


gac 


ctg 


gcc 


get 


ctg 


age 


gtc 


agg 


gga 


gga 


tgc 


tgt 


tec 


Lys 


Arg 


Ser 


Asp 


Leu 


Ala 


Ala 


Leu 


Ser 


Val 


Arg 


Gly 


Gly 


Cys 


Cys 


Ser 


cat 


cct 


gcc 


tgt 


gcg 


gtg 


aat 


cat 


cca 


gag 


ctt 


tgt 


ggc 


tgaagacgct 


His 


Pro 


Ala 


Cys 


Ala 


Val 


Asn 


His 


Pro 


Glu 


Leu 


Cys 


Gly 









gatgccccag gaccctctga accacgacgt 



TABLE 26 

DNA Sequence (SEQ ID NO: 108) and Protein Sequence (SEQ ID NO: 109) of Btl.3 



atg 
Met 


ttc 
Phe 


ace 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


ace 
Thr 


act 
Thr 


gtc 
Val 


gtt 
Val 


t,ce 
Ser 


ttc 
Phe 


act 
Thr 


tea 
Ser 


ggt 
Gly 


cgt 
Arg 


gca 
Ala 


tct 
Ser 


ggt 
Gly 


ggc 
Gly 


agg 
Arg 


aat 
Asn 


get 
Ala 


gca 
Ala 


gcc 
Ala 


aaa 
Lys 


gcg 

Ala 


tct 
Ser 


aac 

Asn 


egg 
Arg 


ate 

He 


get 
Ala 


atg 
Met 


gee 
Ala 


ate 
He 


age 
Ser 


agt 
Ser 


gga 

Gly 


gca 
Ala 


tgc 
Cys 


tgt 
Cys 


gca 
Ala 


" tat 
Tyr 


cct 
Pro 


ccc 
Pro 


tgt 

Cys 


ttc 
Phe 


gag 
Glu 


get 
Ala 


tat 
Tyr 


cca 
Pro 


gaa 
Glu 


aga 
Arg 


tgt 

Cys 


ctg 
Leu 


taacgtgaat 





catccagacc tttgtggctg aagacgctga tgccccagga ccctctgaac cacgacgt 



TABLE 27 

DNA Sequence (SEQ ID NO:l 10) and Protein Sequence (SEQ ID NO:l 1 1) of Btl .4 



atg 
Met 


ttc 
Phe 


ace 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


ace 
Thr 


act 
Thr 


gtc 
Val 


gtt 
Val 


tec 
Ser 


ttc 
Phe 


act 
Thr 


tea 
Ser 


gat 
Asp 


cgt 
Arg 


gca 
Ala 


ttt 
Phe 


cgt 
Arg 


ggc 
Gly 


agg 
Arg 


aat 
Asn 


tee 
Ser 


gca 
Ala 


gee 
Ala 


aac 
Asn 


gac 
Asp 


aaa 
Lys 


agg 
Arg 


tct 
Ser 


gac 
Asp 


ctg 
Leu 


gee 
Ala 


get 
Ala 


ctg 
Leu 


age 
Ser 


gtc 
Val 


agg 
Arg 


aga 
Arg 


gga 
Gly 


tgc 
Cys 


tgc 
Cys 


tee 
Ser 


cat 
His 


ccc 
Pro 


gcc 
Ala 


tgt 
Cys 


age 
Ser 


gtg 
Val 


aat 
Asn 


cat 
His 


cca 
Pro 


gag 
Glu 


ctt 
Leu 


tgt 
Cys 


ggt 
Gly 


aga 
Arg 


aga 
Arg 


cgc 
Arg 



tgatgcceca ggaccctctg aaeeaegacg t 



TABLE 28 

DNA Sequence (SEQ ID NO:l 12) and Protein Sequence (SEQ ID NO:113) of Btl.5 

atg ttc acc gtg ttt ctg ttg gtt gtc ttg gca ace act gtc gtt tec 

Met Phe Thr Val Phe Leu Leu Val Val Leu Ala Thr Thr Val Val Ser 

ttc act tea ggt cgt gca tct ggt ggc agg aat get gca gcc aaa gcg 

Phe Thr Ser Gly Arg Ala Ser Gly Gly Arg Asn Ala Ala Ala Lys Ala 

tct aac egg ate get ctg ate gtc agg aat gca gaa tgc tgt tat tat 

Ser Asn Arg lie Ala Leu lie Val Arg Asn Ala Glu Cys Cys Tyr Tyr 
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cct ccc tgt tac gag get tat cca gaa att tgt ctg taacgtgaat 
Pro Pro Cys Tyr Glu Ala Tyr Pro Glu lie Cys Leu 

catccagacc tttgtggctg aagaccctga tgctccagga ccctctgaac cacgacgt 



TABLE 29 

DNA Sequence (SEQ ID NO: 1 14) and Protein Sequence (SEQ ID NO: 1 1 5) of Pnl . 1 

atg ttc acc gtg ttt ctg ttg gtt gtc ttg gca acc acc gtc att tec 
Met Phe Thr Val Phe Leu Leu Val Val Leu Ala Thr Thr Val lie Ser 



ttc 


act 


tea 


gat 


egt 


gca 


tct 


gat 


Phe 


Thr 


Ser 


Asp 


Arg 


Ala 


Ser 


Asp 


ctg 


ate 


get 


ctg 


acc 


ate 


aag 


gga 


Leu 


He 


Ala 


Leu 


Thr 


He 


Lys 


Gly 


atg 


aat 


aat 


cca 


gac 


tat 


tgt 


ggt 


Met 


Asn 


Asn 


Pro 


Asp 


Tyr 


Cys 


Gly 



ggc ggg aat gee gca geg tet gac 
Gly Gly Asn Ala Ala Ala Ser Asp 

tgc tgt tet eat cct ecc tgt gee 
Cys Cys Ser His Pro Pro Cys Ala 

tgaegacget gatgctccag gaccctctga 



aceacgacg 



TABLE 30 

DNA Sequence (SEQ ID NO:l 16) and Protein Sequence (SEQ ID NO: 11 7) of Pnl. 2 

atg ttc acc gtg ttt ctg ttg gtt gtc ttg gca acc acc gtc gtt tec 
Met Phe Thr Val Phe Leu Leu Val Val Leu Ala Thr Thr Val Val Ser 



ttc 


act 


tea 


gat 


egt 


gca 


tct 


gat 


Phe 


Thr 


Ser 


Asp 


Arg 


Ala 


Ser 


Asp 


ctg 


ate 


get 


ctg 


acc 


ate 


aag 


gga 


Leu 


He 


Ala 


Leu 


Thr 


He 


Lys 


Gly 


ctg 


aat 


aat 


cca 


gac 


tat 


tgt 


ggt 


Leu 


Asn 


Asn 


Pro 


Asp 


Tyr 


Cys 


Gly 



ggc ggg aat gee gca atg tct gac 
Gly Gly Asn Ala Ala Met Ser Asp 

tgc tgt tct cat cct ccc tgt ttc 
Cys Cys Ser His Pro Pro Cys Phe 

tgaegacget gatgctccag gaccctctga 



aceacgacg 



TABLE 31 

DNA Sequence (SEQ ID NO: 1 1 8) and Protein Sequence (SEQ ID NO: 1 1 9) of Sml .3 

atg ttc acc gtg ttt ctg ttg gtt gtc ttg gca acc act gtc gtt tec 
Met Phe Thr Val Phe Leu Leu Val Val Leu Ala Thr Thr Val Val Ser 



ttc 


cct 


tea 


gat 


egt 


gaa 


tct 


gat 


ggc 


geg 


aat 


gac gaa gee cgc 


ace 


Phe 


Pro 


Ser 


Asp 


Arg 


Glu 


Ser 


Asp 


Gly 


Ala 


Asn 


Asp Glu Ala Arg 


Thr 


gac 


gag 


cct 


gag 


gag 


cae 


gga 


ceg 


gac 


agg 


aat 


gga tgc tgt agg 


aat 


Asp 


Glu 


Pro 


Glu 


Glu 


His 


Gly 


Pro 


Asp 


Arg 


Asn 


Gly Cys Cys Arg 


Asn 


cct 


gee 


tgt 


gag 


age 


cae 


aga 


tgt 


ggt 


tgaegacget gatgctccag 




Pro 


Ala 


Cys 


Glu 


Ser 


His 


Arg 


Cys 


Gly 











gaccctctga aceacgacg 
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TABLE 32 

DNA Sequence (SEQ ID NO: 120) and Protein Sequence (SEQ ID NO: 121) of Crl.2 



atg 
Met 


ttc 
Phe 


acc 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


acc 
Thr 


act 
Thr 


gtc 
Val 


gtt 
Val 


tec 
Ser 


ttc 
Phe 


cct 
Pro 


tea 
Ser 


gat 
Asp 


cgt 
Arg 


gca 
Ala 


tct 
Ser 


gat 
Asp 


ggc 
Gly 


agg 
Arg 


aat 
Asn 


gcc 
Ala 


gca 
Ala 


gcc 
Ala 


age 
Ser 


gac 
Asp 


aga 
Arg 


gcg 
Ala 


tct 
Ser 


gac 
Asp 


gcg 
Ala 


gcc 
Ala 


cac 
His 


cag 
Gin 


gga 
Gly 


tgc 
Cys 


tgt 
Cys 


tec 
Ser 


aac 
Asn 


cct 
Pro 


gtc 
Val 


tgt 
Cys 


cac 
His 


gtg 
Val 


gaa 
Glu 


cat 
His* 


cca 
Pro 


gaa 
Glu 


ctt 
Leu 


tgt 
Cys 


cgt 
Arg 


aga 
Arg 


aga 
Arg 


cgc 
Arg 


tgatgctcca 





ggaccctctg aaccacgacg 



TABLE 33 

DNA Sequence (SEQ ID NO: 122) and Protein Sequence (SEQ ID NO: 123) of Crl.3 



atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gca 


acc 


act 


gtc 


gtt 


tec 


Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Thr 


Val 


Val 


Ser 


ttc 


cct 


tea 


aat 


cgt 


gaa 


tct 


gat 


ggc 


gcg 


aat 


gee 


gaa 


gtc 


cgc 


acc 


Phe 


Pro 


Ser 


Asn 


Arg 


Glu 


Ser 


Asp 


Gly 


Ala 


Asn 


Ala 


Glu 


Val 


Arg 


Thr 


gac 


gag 


cct 


gag 


gag 


cac 


gac 


gaa 


ctg 


ggc 


ggg' 


aat 


gga 


tgc 


tgt 


ggg 


Asp 


Glu 


Pro 


Glu 


Glu 


His 


Asp 


Glu 


Leu 


Gly 


Gly 


Asn 


Gly 


Cys 


Cys 


Gly 


aat 


cct 


gac 


tgt 


aeg 


age 


cac 


agt 


tgt 


gat 


tgacgacgct gatgctccag 


Asn 


Pro 


Asp 


Cys 


Thr 


Ser 


His 


Ser 


Cys 


Asp 















gaccctctga accacgacg 



TABLE 34 

DNA Sequence (SEQ ID NO: 124) and Protein Sequence (SEQ ID NO: 125) of EpI 



atg 
Met 


ttc 
Phe 


ace 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 

Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


acc 
Thr 


acc 
Thr 


gtc 
Val 


gtt 
Val 


tec 
Ser 


ttc 
Phe 


act 
Thr 


tea 
Ser 


gat 
Asp 


cgt 
Arg 


gca 
Ala 


tct 
Ser 


gat 
Asp 


age 
Ser 


agg 
Arg 


aag 
Lys 


gac 
Asp 


gca 
Ala 


gcg 
Ala 


tct 
Ser 


ggc 
Gly 


ctg 
Leu 


ate 
He 


get 
Ala 


ctg 
Leu 


ace 
Thr 


ate 
He 


aag 
Lys 


gga 
Gly 


tgc 
Cys 


tgt 
Cys 


tct 
Ser 


gat 
Asp 


cct 
Pro 


cgc 
Arg 


tgt 
Cys 


aac 
Asn 


atg 
Met 


aat 
Asn 


aat 
Asn 


cca 
Pro 


gac 
Asp 


tat 
Tyr 


tgt 
Cys 


ggt 
Gly 


tgacgacgct ^ 


gatgctccag gaccctctga 



accacgacg 



TABLE 35 

DNA Sequence (SEQ ID NO:126) and Protein Sequence (SEQ ID NO:127) of Snl.l 

atg tec acc gtg ttt ctg ttg gtt gtc etc gca acc acc gtc gtt tec 
Met Ser Thr Val Phe Leu Leu Val Val Leu Ala Thr Thr Val Val Ser 



42 



ttc 


act 


gta 


gat 


cgt 


gca 


tct 


gat 


ggc 


agg 


gat 


gtc gca 


ate 


gac 


gac 


Phe 


Thr 


Val 


Asp 


Arg 


Ala 


Ser 


Asp 


Gly 


Arg 


Asp 


Val Ala 


He 


Asp 


Asp 


aga 


ttg 


gtg 


tct 


etc 


cct 


cag 


ate 


gcc 


cat 


get 


gac tgt 


tgt 


tec 


gat 


Arg 


Leu 


Val 


Ser 


Leu 


Pro 


Gin 


He 


Ala 


His 


Ala 


Asp Cys 


Cys 


Ser 


Asp 


cct 


gcc 


tgc 


aag 


cag 


acg 


ccc 


ggt 


tgt 


cgt 


taaagacgct 


gctgctccag 


Pro 


Ala 


Cys 


Lys 


Gin 


Thr 


Pro 


Gly 


Cys 


Arg 













gaccctctga accacgacg 



TABLE 36 

DNA Sequence (SEQ ID NO: 128) and Protein Sequence (SEQ ID NO: 129) of Snl.2 



atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gca 


acc 


ace 


gtc 


get 


tec 


Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Thr 


Val 


Ala 


Ser 


ttc 


att 


ate 


gat 


gat 


cca 


tct 


gat 


ggc 


agg 


aat 


att 


gca 


gtc 


gac 


gac 


Phe 


He 


He 


Asp 


Asp 


Pro 


Ser 


Asp 


Gly 


Arg 


Asn 


He 


Ala 


Val 


Asp 


Asp 


aga 


ggg 


ctt 


ttc 


tct 


acg 


etc 


ttc 


cat 


get 


gat 


tgc 


tgt 


gaa 


aat 


cct 


Arg 


Gly 


Leu 


Phe 


Ser 


Thr 


Leu 


Phe 


His 


Ala 


Asp 


Cys 


Cys 


Glu 


Asn 


Pro 



gee tgt aga eac acg cag ggt tgt tgatetttgt tetteaaaga eaetgctggc 
Ala Cys Arg His Thr Gin Gly Cys 

ccaggaccet etgaaccacg acg 



TABLE 37 

DNA Sequence (SEQ ID NO:130) and Protein Sequence (SEQ ID NO:131) of Dal. 1 



atg 
Met 


ttc 
Phe 


acc 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


acc 
Thr 


acc 
Thr 


gtc 
Val 


gtt 
Val 


tec 
Ser 


ttc 
Phe 


act 
Thr 


tea 
Ser 


gat 
Asp 


cgt 
Arg 


gca 
Ala 


ttt 
Phe 


cgt 
Arg 


ggc 
Gly 


agg 
Arg 


aat 
Asn 


gee 
Ala 


gca 
Ala 


gcc 
Ala 


aaa 

Lys 


gag 
Glu 


tct 
Ser 


ggc 
Gly 


ctg 
Leu 


gtc 
Val 


ggt 
Gly 


ctg 
Leu 


acc 
Thr 


gae 
Asp 


aag 
Lys 


acg 
Thr 


cga 
Arg 


gga 
Gly 


tgc 
Cys 


tgt 

Cys 


tct 
Ser 


eat 
His 


cct 
Pro 


gcc 
Ala 


tgt 
Cys 


aac 
Asn 


gta 
Val 


gat 
Asp 


eat 
His 


eca 
Pro 


gaa 
Glu 


att 
He 


tgt 
Cys 


ggt 
Gly 


tgaagacgct 





gatgetccag gaccctctga accacgacgt 



TABLE 38 

DNA Sequence (SEQ ID NO: 132) and Protein Sequence (SEQ ID NO: 133) of Dal. 2 

atg ttc acc gtg ttt ctg ttg gtt gtc ttg gca acc ace gtc gtt tee 
Met Phe Thr Val Phe Leu Leu Val Val Leu Ala Thr Thr Val Val Ser 

ttc act tea gat ggt gca tct gat gae agg aaa gcc get geg tct gac 
Phe Thr Ser Asp Gly Ala Ser Asp Asp Arg Lys Ala Ala Ala Ser Asp 

ctg ate act ctg acc ate aag gga tgc tgt tct cgt cct ccc tgt ate 
Leu He Thr Leu Thr He Lys Gly Cys Cys Ser Arg Pro Pro Cys He 



4 • 



43 

gcg aat aat cca gac ttg tgt ggt cga cga cgc tgatgctcca ggaccctctg 
Ala Asn Asn Pro Asp Leu Cys Gly Arg Arg Arg 



TABLE 39 

DNA Sequence (SEQ ID NO: 134) and Protein Sequence (SEQ ID NO: 135) of Dal .3 



atg 
Met 


ttc 
Phe 


acc 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


acc 
Thr 


act 
Thr 


gtc 
Val 


gtt 
Val 


tec 
Ser 


tec 
Ser 


act 
Thr 


tea 
Ser 


ggt 
Gly 


cgt 
Arg 


cgt 
Arg 


gca 
Ala 


ttt 
Phe 


cat 
His 


ggc 
Gly 


agg 
Arg 


aat 
Asn 


gcc 
Ala 


gca 
Ala 


gcc 
Ala 


aaa 
Lys 


gcg 
Ala 


tct 
Ser 


gga 
Gly 


ctg 
Leu 


gtc 
Val 


ggt 
Gly 


ctg 
Leu 


act 
Thr 


gac 
Asp 


agg 
Arg 


aga 
Arg 


cca 
Pro 


caa 
Gin 


tgc 
Cys 


tgt 
Cys 


agt 
Ser 


gat 
Asp 


cct 
Pro 


cgc 
Arg 


tgt 
Cys 


aac 
Asn 


gta 
Val 


ggt 
Gly 


cat 
His 


cca 
Pro 


gaa 
Glu 


ctt 
Leu 


tgt 
Cys 


ggt 
Gly 


gga 
Gly 


aga 
Arg 


cgc 
Arg 



tgatgctcca ggaccctctg aaccacaacg t 

-A 

W TABLE 40 

'd\5 DNA Sequence (SEQ ID NO: 136) and Protein Sequence (SEQ ID NO: 137) of Dal .4 

in 



atg 
Met 


ttc 
Phe 


acc 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 

Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


acc 
Thr 


act 
Thr 


gtc 
Val 


gtt 
Val 


tec 
Ser 


tec 
Ser 


act 
Thr 


tea 
Ser 


ggt 
Gly 


cgt 
Arg 


gca 
Ala 


ttt 
Phe 


cat 
His 


ggc 
Gly 


agg 
Arg 


aat 
Asn 


gcc 
Ala 


gca 
Ala 


gcc 
Ala 


aaa 
Lys 


gcg 
Ala 


tct 
Ser 


ggc 
Gly 


ctg 
Leu 


gtc 
Val 


ggt 
Gly 


ctg 
Leu 


acc 
Thr 


gac 
Asp 


aag 
Lys 


agg 
Arg 


caa 
Gin 


gta 
Val 


tgc 
Cys 


tgt 
Cys 


agt 
Ser 


gat 
Asp 


cct 
Pro 


cgc 
Arg 


tgt 
Cys 


aac 
Asn 


gta 
Val 


ggt 
Gly 


cat 
His 


cca 
Pro 


gaa 
Glu 


att 
He 


tgt 
Cys 


ggt 
Gly 


gga 
Gly 


aga 
Arg 


cgc 
Arg 





tgatgctcca ggaccctctg aaccacgacg t 



25 TABLE 41 

DNA Sequence (SEQ ID NO: 138) and Protein Sequence (SEQ ID NO: 139) of A 1.2 





atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


aca 


acc 


act 


gtc 


gtt 


tee 




Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Thr 


Thr 


Thr 


Val 


Val 


Ser 


30 


ttc 


cct 


tea 


gat 


agt 


gca 


tct 


ggt 


ggc 


agg 


gat 


gac 


gag 


gcc 


aaa 


gac 


Phe 


Pro 


Ser 


Asp 


Ser 


Ala 


Ser 


Gly 


Gly 


Arg 


Asp 


Asp 


Glu 


Ala 


Lys 


Asp 




gaa 


agg 


tct 


gac 


atg 


tac 


gaa 


ttg 


aaa 


egg 


aat 


gga 


cgc 


tgt 


tgc 


cat 




Glu 


Arg 


Ser 


Asp 


Met 


Tyr 


Glu 


Leu 


Lys 


Arg 


Asn 


Gly 


Arg 


Cys 


Cys 


His 




cct 


gcc 


tgt 


ggt 


ggc 


aaa 


tac 


gtt 


aaa 


tgt 


gga 


cgc 


tgatgctcca 






Pro 


Ala 


Cys 


Gly 


Gly 


Lys 


Tyr 


Val 


Lys 


Cys 


Gly 


Arg 











ggaccctcte gaaccacg 
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TABLE 42 

DNA Sequence (SEQ ID NO: 140) and Protein Sequence (SEQ ID NO: 141) of Bui. 1 





atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gca acc act gtc 


gtt 


tec 




Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala Thr Thr Val 


Val 


Ser 


5 


ttc 


tct 


aca 


gat 


gat 


gaa 


tct 


gat 


ggc 


teg 


aat gaa gaa ccc 


age 


gcc 




Phe 


Ser 


Thr 


Asp 


Asp 


Glu 


Ser 


Asp 


Gly 


Ser 


Asn Glu Glu Pro 


Ser 


Ala 




gac 


cag 


act 


gcc 


agg 


tec 


tea 


atg 


aac 


agg 


gcg cct gga tgc 


tgt 


aac 




Asp 


Gin 


Thr 


Ala 


Arg 


Ser 


Ser 


Met 


Asn 


Arg 


Ala Pro Gly Cys 


Cys 


Asn 


10 


aat 


cct 


gcc 


tgt 


gtg 


aag 


cac 


aga 


tgt 


gga 


tgacgctgat gctccaggac 


Asn 


Pro 


Ala 


Cys 


Val 


Lys 


His 


Arg 


Cys 


Gly 









cctctgaacc acgacgt 

TABLE 43 

DNA Sequence (SEQ ID NO: 142) and Protein Sequence (SEQ ID NO: 143) of Bui. 2 





atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gca 


acc 


act 


gtc 


gtt 


tec 


15 


Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


•Ala 


Thr 


Thr 


Val 


Val 


Ser 




ttc 


tct 


aca 


gat 


gat 


gaa 


tct 


gat 


ggc 


teg 


aat 


gaa 


gaa 


ccc 


age 


gcc 




Phe 


Ser 


Thr 


Asp 


Asp 


Glu 


Ser 


Asp 


Gly 


Ser 


Asn 


Glu 


Glu 


Pro 


Ser 


Ala 




gac 


cag 


get 


gee 


agg 


tec 


gca 


atg 


aac 


agg 


ccg 


cct 


gga 


tgc 


tgt 


aac 




Asp 


Gin 


Ala 


Ala 


Arg 


Ser 


Ala 


Met 


Asn 


Arg 


Pro 


Pro 


Gly 


Cys 


Cys 


Asn 


20 


aat 


cct 


gcc 


tgt 


gtg 


aag 


cac 


aga 


tgt 


ggt 


gga 


tgacgctgat ^ 


gctccaggac 




Asn 


Pro 


Ala 


Cys 


Val 


Lys 


His 


Arg 


Cys 


Gly 


Gly 













cctctgaacc acgacgt 

TABLE 44 

DNA Sequence (SEQ ID NO: 144) and Protein Sequence (SEQ ID NO: 145) of Bui .3 



atg 
Met 


ttc 
Phe 


ace 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


acc 
Thr 


act 
Thr 


gtc 
Val 


gtt 
Val 


tec 
Ser 


ttc 
Phe 


cct 
Pro 


tea 
Ser 


gat 
Asp 


cgt 
Arg 


gac 
Asp 


tct 
Ser 


gat 
Asp 


ggc 
Gly 


gcg 
Ala 


gat 
Asp 


gee 
Ala 


gaa 
Glu 


gcc 
Ala 


agt 
Ser 


gac 
Asp 


gag 
Glu 


cct 
Pro 


gtt 
Val 


gag 
Glu 


ttc 
Phe 


gaa 
Glu 


agg 
Arg 


gac 
Asp 


gag 
Glu 


aat 
Asn 


gga 
Gly 


tgc 
Cys 


tgt 

Cys 


tgg 
Trp 


aat 
Asn 


cct 
Pro 


tec 
Ser 


tgt 
Cys 


ccg 
Pro 


agg 
Arg 


ccc 
Pro 


aga 
Arg 


tgt 
Cys 


aca 
Thr 


gga 
Gly 


ega 
Arg 


egc 
Arg 


taatgetcea ggacectctg 



aaccacgacg t 

TABLE 45 

35 DNA Sequence (SEQ ID NO:146) and Protein Sequence (SEQ ID NO:170) of Bul.4 



atg ttc ace gtg ttt ctg ttg gtt gtc ttg aca acc act gtc gtt tee 
Met Phe Thr Val Phe Leu Leu Val Val Leu Thr Thr Thr Val Val Ser 
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ttc 


cct 


tea 


gat 


cgt 


gea 


tct 


gat 


ggc 


agg 


aat 


gcc 


gea 


gcc 


aac 


gac 


Phe 


Pro 


Ser 


Asp 


Arg 


Ala 


Ser 


Asp 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Asn 


Asp 


aaa 


gcg 


tct 


gac 


gtg 


gtc 


acg 


Ctg 


gtc 


etc 


aag 


gga 


tge 


tgt 


tec 


ace 


Lys 


Ala 


Ser 


Asp 


Val 


Val 


Thr 


Leu 


Val 


Leu 


Lys 


Gly 


Cys 


Cys 


Ser 


Thr 


cct 


ccc 


tgt 


get 


gtg 


ctg 


tat 


tgt 


ggt 


aga 


aga 


egc 


tgatgctcea 




Pro 


Pro 


Cys 


Ala 


Val 


Leu 


Tyr 


Cys 


Gly 


Arg 


Arg 


Arg 











ggaccctctg aaccacgacg t 



TABLE 46 

DNA Sequence (SEQ ID NO:148) and Protein Sequence (SEQ ID NO:149) of Dil.l 



atg 
Met 


ttc 
Phe 


acc 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttc 
Phe 


gea 
Ala 


tec tct 
Ser Ser 


gtc 
Val 


ace 
Thr 


tta 
Leu 


gat 
Asp 


cgt 
Arg 


gea 
Ala 


tct 
Ser 


tat 
Tyr 


ggc 
Gly 


agg 
Arg 


tat 
Tyr 


gcc 
Ala 


tea 
Ser 


ccc 
Pro 


gtc gac 
Val Asp 


aga 
Arg 


gcg 
Ala 


tct 
Ser 


gcc 
Ala 


ctg 
Leu 


ate 
He 


get 
Ala 


cag 
Gin 


gcc 
Ala 


ate 
He 


ctt 
Leu 


cga 
Arg 


gat 
Asp 


tge 
Cys 


tge tee 
Cys Ser 


aat 
Asn 


cct 
Pro 


cct 
Pro 


tgt 
Cys 


gcc 
Ala 


cat 
His 


aat 
Asn 


aat 
Asn 


cca 
Pro 


gac 
Asp 


tgt 
Cys 


cgt 
Arg 


taaagacgct gcttgctcca 





ggaccctctg aaccacgacg t 



TABLE 47 

20 DNA Sequence (SEQ ID NO: 150) and Protein Sequence (SEQ ID NO: 151) of Tl 

gga tge tgt tct aat cct ccc tgt ate gcg aag aat cca cac atg tgt 
Gly Cys Cys Ser Asn Pro Pro Cys He Ala Lys Asn Pro His Met Cys 

ggt gga aga egc tga 
Gly Gly Arg Arg 



25 TABLE 48 

DNA Sequence (SEQ ID NO: 1 52) and Protein Sequence (SEQ ID NO: 1 53) of Cnl .2 





atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gea 


acc 


act 


gtc 


gtt 


tec 




Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Thr 


Val 


Val 


Ser 


30 


ttc 


cct 


tea 


gat 


cgt 


gea 


tct 


gat 


ggc 


agg 


aat 


gee 


gea 


gee 


aac 


gac 


Phe 


Pro 


Ser 


Asp 


Arg 


Ala 


Ser 


Asp 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Asn 


Asp 




aaa 


gcg 


tct 


gac 


gtg 


ate 


acg 


ctg 


gcc 


etc 


aag 


gga 


tge 


tgt 


tec 


aac 




Lys 


Ala 


Ser 


Asp 


Val 


He 


Thr 


Leu 


Ala 


Leu 


Lys 


Gly 


Cys 


Cys 


Ser 


Asn 




cct 


gtc 


tgt 


cac 


ttg 


gag 


eat 


tea 


aac 


ctt 


tgt 


ggt 


aga 


aga 


egc 






Pro 


Val 


Cys 


His 


Leu 


Glu 


His 


Ser 


Asn 


Leu 


Cys 


Gly 


Arg 


Arg 


Arg 





35 



tgatgctcea ggaccctctg aaccacgacg t 
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TABLE 49 

DNA Sequence (SEQ ID NO:233) and Protein Sequence (SEQ ID NO:234) of Iml.l 

tot gat gga aag agt gcc gcg gcc aaa gcc aaa ccg tct cac ctg acg 

Ser Asp Gly Lys Ser Ala Ala Ala Lys Ala Lys Pro Ser His Leu Thr 

get cca ttc ate agg gae gaa tge tgt tee gat tct cgc tgt ggc aag 

Ala Pro Phe lie Arg Asp Glu Cys Cys Ser Asp Ser Arg Cys Gly Lys 

aae tgt ctt tga 
• Asn Cys Leu 



TABLE 50 

DNA Sequence (SEQ ID NO:235) and Protein Sequence (SEQ ID NO:236) of ImL2 

ttt gat gga agg aat gcc cca gcc gac gac aaa gcg tct gac ctg ate 
Phe Asp Gly Arg Asn Ala Pro Ala Asp Asp Lys Ala Ser Asp Leu lie 

get caa ate gtc agg aga gea tge tgt tec gat cgt cgc tgt aga tgg 
Ala Gin lie Val Arg Arg Ala Cys Cys Ser Asp Arg Arg Cys Arg Trp 

agg tgt ggt tga 
Arg Cys Gly 



TABLE 51 

DNA Sequence (SEQ ID NO:237) and Protein Sequence (SEQ ID NO:238) of Rgl.2 

tct gat gga agg aat gee gea gee gac gcc aga gcg tct ccc egg ate 
Ser Asp Gly Arg Asn Ala Ala Ala Asp Ala Arg Ala Ser Pro Arg lie 

get ctt ttc etc agg ttc aea tge tgt agg aga ggt acc tgt tee eag 
Ala Leu Phe Leu Arg Phe Thr Cys Cys Arg Arg Gly Thr Cys Ser Gin 

cac tgt ggt tgaagaeact gctgcteeag gaccctctga accacgacgt 
His Cys Gly 



TABLE 52 

DNA Sequence (SEQ ID NO:239) and Protein Sequence (SEQ ID NO:240) of RgL6 

tct aat gga agg aat gee gea gcc gac gee aaa gcg tct caa egg ate 
Ser Asn Gly Arg Asn Ala Ala Ala Asp Ala Lys Ala Ser Gin Arg lie 

get cca ttc etc agg gac tat tge tgt agg aga cat gcc tgt acg ttg 
Ala Pro Phe Leu Arg Asp Tyr Cys Cys Arg Arg His Ala Cys Thr Leu 

att tgt ggt tgaagacget gctgcteeag gaccctctga accacgacgt 
lie Cys Gly 



TABLE 53 

DNA Sequence (SEQ ID NO:241) and Protein Sequence (SEQ ID NO:242) of RgL6A 

tct aat gga agg aat gee gea gcc gac gcc aaa gcg tct caa egg ate 



4 
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Ser Asn Gly Arg Asn Ala Ala Ala Asp Ala Lys Ala Ser Gin Arg lie 

get cca ttc etc agg gac tat tgc tgt agg aga cct ccc tgt acg ttg 
Ala Pro Phe Leu Arg Asp Tyr Cys Cys Arg Arg Pro Pro Cys Thr Leu 

att tgt ggt tgaagacgct gctgctccag gaccctctga accacgacgt 
5 lie Cys Gly 

TABLE 54 

DNA Sequence (SEQ ID NO:243) and Protein Sequence (SEQ ID NO:244) of RgL7 

tct aat aaa agg aag aat gcc gca atg ctt gac atg ate get caa cac 
Ser Asn Lys Arg Lys Asn Ala Ala Met Leu Asp Met lie Ala Gin His 

10 gcc ata agg ggt tgc tgt tec gat cct cgc tgt aga tat aga tgt cgt 

Ala lie Arg Gly Cys Cys Ser Asp Pro Arg Cys Arg Tyr Arg Cys Arg 

tgaagacgct gctgctccag gaccctctga accacgacgt 

TABLE 55 

DNA Sequence (SEQ ID NO:245) and Protein Sequence (SEQ ID NO:246) of RgL9 

15 ttt aat gga agg agt gcc gca gcc gac caa aat gcg cct ggc ctg ate 

Phe Asn Gly Arg Ser Ala Ala Ala Asp Gin Asn Ala Pro Gly Leu lie 

get caa gtc gtc aga gga ggg tgc tgt tec gat ccc cgc tgc gcc tgg 
Ala Gin Val Val Arg Gly Gly Cys Cys Ser Asp Pro Arg Cys Ala Trp 

aga tgt ggt tgaagacgtt gctgctccag gaccctctga accacgacgt 
20 Arg Cys Gly 



TABLE 56 

DNA Sequence (SEQ ID NO:247) and Protein Sequence (SEQ ID NO:248) of Rgl.lO 

ttt gat gga agg aat gcc gca gcc gac gcc aaa gtg att aac acg gtc 
Phe Asp Gly Arg Asn Ala Ala Ala Asp Ala Lys Val lie Asn Thr Val 

25 get ega ate gcc tgg gat ata tgc tgt tec gaa cct gac tgt aac cat 

Ala Arg lie Ala Trp Asp lie Cys Cys Ser Glu Pro Asp Cys Asn His 

aaa tgt gtt tgaagacgct tetgetccag gaccctctga accacgacgt 
Lys Cys Val 



• TABLE 57 

30 DNA Sequence (SEQ ID NO:249) and Protein Sequence (SEQ ID NO:250) of Rgl.l 1 

tct aat aaa agg aag aat gcc gca atg ctt gac atg ate get caa cac 
Ser Asn Lys Arg Lys Asn Ala Ala Met Leu Asp Met lie Ala Gin His 

gcc ata agg ggt tgc tgt tee gat cct cgc tgt aaa cat cag tgt ggt 
Ala lie Arg Gly Cys Cys Ser Asp Pro Arg Cys Lys His Gin Cys Gly 

35 tgaagacgct gctgctccag gaccctctga accacgacgt 



4 
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TABLE 58 





DNA Sequence 


(SEQ ID NO:251) and Protein Sequence (SEQ ID NO:252) of Msl.7 




ate 
He 


aag 
Lys 


aat 
Asn 


aca gca gcc age aac aaa geg tct age etg gtg get ctt 
Thr Ala Ala Ser Asn Lys Ala Ser Ser Leu Val Ala Leu 


5 


gtt 
Val 


gtc 
Val 


agg 
Arg 


gga tgc tgt tac aat ect gtc tge aag aaa tat tat tgt 
Gly Cys Cys Tyr Asn Pro Val Cys Lys Lys Tyr Tyr Cys 




tgg 
Trp 


aaa 
Lys 


ggc 
Gly 


tgatgetcca ggaccetctg aaccacgacg t 
TABLE 59 


10 


DNA Sequence (SEQ ID NO:253) and Protein Sequence (SEQ ID NO:254) of PI .7 


CI 


tot 
Ser 


gaa 
Glu 


ggc 
Gly 


agg aat get gaa gcc ate gac aae gcc tta gac cag agg 
Arg Asn Ala Glu Ala He Asp Asn Ala Leu Asp Gin Arg 




gat 
Asp 


cca 
Pro 


aag 
Lys 


cga cag gag ceg ggg tgc tgt agg cat cct gcc tgt ggg 
Arg Gin Glu Pro Gly Cys Cys Arg His Pro Ala Cys Gly 


ill 5 


aag 
Lys 


aac 
Asn 


aga 
Arg 


tgt gga aga cgc tgatgetcca ggaccetctg aaccacgacg t 
Cys Gly Arg Arg 


i! 

is 

IJ 








TABLE 60 




DNA 


Sequence 


(SEQ ID NO:255) and Protein Sequence (SEQ ID NO:256) of Msl.2 


IJ20 


tct 
Ser 


gat 
Asp 


ggc 
Gly 


agg aat att gca gtc gac gac aga tgg tct ttc tat aeg 
Arg Asn He Ala Val Asp Asp Arg Trp Ser Phe Tyr Thr 




etc 
Leu 


ttc 
Phe 


cat 
His 


get act tge tgt gee gat ect gac tgt aga ttc egg ecc 
Ala Thr Cys Cys Ala Asp Pro Asp Cys Arg Phe Arg Pro 




ggt 
Gly 


tgt 
Cys 


tgatctttgt tcttcaaaga cgctgctggc ccaggaccct ctgaaccacg 


25 


aegt 














TABLE 61 




DNA Sequence 


(SEQ ID NO:257) and Protein Sequence (SEQ ID NO:258) of Msl .3 




ate 
He 


aag 
Lys 


aat 
Asn 


act gca gcc age aae aaa gcg cct age etg gtg get att 
Thr Ala Ala Ser Asn Lys Ala Pro Ser Leu Val Ala He 


30 


gee 
Ala 


gtc 
Val 


agg 
Arg 


gga tge tgt tac aat ect tec tgt tgg ceg aaa aca tat 
Gly Cys Cys Tyr Asn Pro Ser Cys Trp Pro Lys Thr Tyr 




tgt 
Cys 


agt 
Ser 


tggaaaggct gatgctccag gaccctctga accacgaegt 
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TABLE 62 

DNA Sequence (SEQ ID NO:259) and Protein Sequence (SEQ ID NO:260) of Msl .4 

tct gat age agg aat gtc gca ate gag gac aga gtg tct gac ctg cac 
Ser Asp Ser Arg Asn Val Ala lie Glu Asp Arg Val Ser Asp Leu His 

tct atg ttc ttc gat gtt tct tgc tgt age aat cct acc tgt aaa gaa 
Ser Met Phe Phe Asp Val Ser Cys Cys Ser Asn Pro Thr Cys Lys Glu 

acg tat ggt tgt tgatcgttgg ttttgaagac gctgatgctc caggaccctc 
Thr Tyr Gly Cys 



TABLE 63 

DNA Sequence (SEQ ID NO:261) and Protein Sequence (SEQ ID NO:262) of Msl .5 

tct gtt ggc agg aat att gca gtc gac gac aga ggg att ttc tct acg 
Ser Val Gly Arg Asn lie Ala Val Asp Asp Arg Gly lie Phe Ser Thr 

etc ttc cat get cat tgc tgt gee aat cee ate tgt aaa aac acg cec 
Leu Phe His Ala His Cys Cys Ala Asn Pro lie Cys Lys Asn Thr Pro 

ggt tgt tgatctttgt tettcaaaga cgctgctggc ccaggaccet etgaaceaeg 
Gly Cys 

aegt 



TABLE 64 

DNA Sequence (SEQ ID NO:263) and Protein Sequence (SEQ ID NO:264) of MsL8 

tec gat ggc agg aat gtc gca ate gac gac aga gtg tct gac ctg cac 
Ser Asp Gly Arg Asn Val Ala lie Asp Asp Arg Val Ser Asp Leu His 

tct atg ttc ttc gat att get tgc tgt aac aat cct ace tgt aaa gaa 
Ser Met Phe Phe Asp lie Ala Cys Cys Asn Asn Pro Thr Cys Lys Glu 

acg tat ggt tgt tgatcgttgg ttttgaagac gctgatgctc caggaccctc 
Thr Tyr Gly Cys 

tgaaccacga cgt 



TABLE 65 

DNA Sequence (SEQ ID NO:265) and Protein Sequence (SEQ ID NO:266) of Msl. 9 

tct gat ggc agg aat gtc gca ate gag gac aga gtg tct gac ctg etc 
Ser Asp Gly Arg Asn Val Ala lie Glu Asp Arg Val Ser Asp Leu Leu 

tct atg etc ttc gat gtt get tgc tgt age aat cct gtc tgt aaa gaa 
Ser Met Leu Phe Asp Val Ala Cys Cys Ser Asn Pro Val Cys Lys Glu 

acg tat ggt tgt tgatcgttgg ttttgaagac gctgatgctc caggaccctc 
Thr Tyr Gly Cys 

tgaaccacga cgt 
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TABLE 66 

DNA Sequence (SEQ ID NO:267) and Protein Sequence (SEQ ID NO:268) of Btl,7 

tat gat ggc agg aat get gee gee gae gac aaa get ttt gac etg etg 
Tyr Asp Gly Arg Asn Ala Ala Ala Asp Asp Lys Ala Phe Asp Leu Leu 



get atg aee ata agg gga gga tgc tgt tee tat cet ecc tgt ate gcg 
Ala Met Thr lie Arg Gly Gly Cys Cys Ser Tyr Pro Pro Cys lie Ala 



agt aat cet aaa tgt ggt gga aga cgc tgatgctcca ggaecctctg 
Ser Asn Pro Lys Cys Gly Gly Arg Arg 



aaccacaacg t 



TABLE 67 

DNA Sequence (SEQ ID NO:269) and Protein Sequence (SEQ ID NO:270) of LvL5 

ttt gat ggc agg aat get gca ggc aac gee aaa atg tec gee etg atg 
Phe Asp Gly Arg Asn Ala Ala Gly Asn Ala Lys Met Ser Ala Leu Met 

gcc etg acc ate agg gga tgc tgt tec cat cet gtc tgt age geg atg 
Ala Leu Thr lie Arg Gly Cys Cys Ser His Pro Val Cys Ser Ala Met 

agt cca ate tgt gge tgaagacgct gatgccccag gaccctctga aeeaegaegt 
Ser Pro lie Cys Gly 



TABLE 68 

DNA Sequence (SEQ ID NO:271) and Protein Sequence (SEQ ID NO:272) of MsLlO 



ate 
lie 


aag 
Lys 


aat 
Asn 


get 
Ala 


gca 
Ala 


get 
Ala 


gac 
Asp 


ate 
lie 


gtc 
Val 


agg 
Arg 


aat 
Asn 


get 
Ala 


gca 
Ala 


tec 
Ser 


ett 
Leu 


gcc 
Ala 


etc 
Leu 


agg 
Arg 


gga 
Gly 


tgc 
Cys 


tgt 
Cys 


aca 
Thr 


tat 
Tyr 


tgt 
Cys 


ggt 
Gly 


aga 
Arg 


aga 
Arg 


cgc 
Arg 



TABLE 69 

DNA Sequence (SEQ ID NO:273) and Protein Sequence (SEQ ID NO:274) of OmLl 

tetgatggca ggaatgccgc agegtctgae ctgatggat etg acc ate aag gga 

Leu Thr lie Lys Gly 

tgc tgt tct tat cet ecc tgt ttc gcg act aat cca gac tgt ggt cga 
Cys Cys Ser Tyr Pro Pro Cys Phe Ala Thr Asn Pro Asp Cys Gly Arg 

cga cgc tgatgctcca ggaecctctg aaccacgacg t 
Arg Arg 
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TABLE 70 

DNA Sequence (SEQ ID NO:275) and Protein Sequence (SEQ ID NO:276) of Rl .6 



ttt 


gat 


ggc 


agg 


aat 


gcc 


gea 


gee 


gac 


tac 


aaa 


ggg 


tct 


gaa 


ttg 


etc 


Phe 


Asp 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Asp 


Tyr 


Lys 


Gly 


Ser 


Glu 


Leu 


Leu 


get 


atg 


acc 


gtc 


agg 


gga 


gga 


tgc 


tgt 


tec 


tat 


cct 


ccc 


tgt 


ate 


gea 


Ala 


Met 


Thr 


Val 


Arg 


Gly 


Gly 


Cys 


Cys 


Ser 


Tyr 


Pro 


Pro 


Cys 


He 


Ala 


aat 


aat 


cct 


ctt 


tgt 


get 


gga 


aga 


egc 


tga 














Asn 


Asn 


Pro 


Leu 


Cys 


Ala 


Gly 


Arg 


Arg 

















TABLE 71 

DNA Sequence (SEQ ID NO:277) and Protein Sequence (SEQ ID NO:278) of R1.7 



ttt 


gat 


ggc 


agg 


aat 


gcc 


gea 


gcc 


gac 


tac 


aaa 


ggg 


tct 


gaa 


ttg 


etc 


Phe 


Asp 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Asp 


Tyr 


Lys 


Gly 


Ser 


Glu 


Leu 


Leu 


get 


atg 


acc 


gtc 


agg 


gga 


gga 


tgc 


tgt 


tec 


tat 


cct 


ccc 


tgt 


ate 


gea 


Ala 


Met 


Thr 


Val 


Arg 


Gly 


Gly 


Cys 


Cys 


Ser 


Tyr 


Pro 


Pro 


Cys 


He 


Ala 


aat 


aat 


cct 


ttt 


tgt 


get 


gga 


aga 


egc 


tga 














Asn 


Asn 


Pro 


Phe 


Cys 


Ala 


Gly 


Arg 


Arg 

















TABLE 72 

DNA Sequence (SEQ ID NO:279) and Protein Sequence (SEQ ID NO:280) of Vrl.l 

tct tat gac agg tat gcc teg ccc gtc gac aga gcg tct gcc ctg ate 
Ser Tyr Asp Arg Tyr Ala Ser Pro Val Asp Arg Ala Ser Ala Leu lie 

get eag gcc ate ctt ega gat tgc tgt tec aat cct ccc tgt tec caa 
Ala Gin Ala lie Leu Arg Asp Cys Cys Ser Asn Pro Pro Cys Ser Gin 

aat aat cca gac tgt atg taaagacget gettgcteca ggaccctctg 
Asn Asn Pro Asp Cys Met 

aaccacgacg t 



TABLE 73 

DNA Sequence (SEQ ID NO:281) and Protein Sequence (SEQ ID NO:282) of Vrl.2 

tct tat ggc agg tat gcc tea ccc gtc gac aga gcg tct gee ctg ate 
Ser Tyr Gly Arg Tyr Ala Ser Pro Val Asp Arg Ala Ser Ala Leu lie 

get eag gcc ate ctt ega gat tgc tgc tec aat cct cct tgt gcc cat 
Ala Gin Ala lie Leu Arg Asp Cys Cys Ser Asn Pro Pro Cys Ala His 

aat aat cca gac tgt cgt taaagacget gettgcteca ggaccctctg 
Asn Asn Pro Asp Cys Arg 

aaccacgacg t 
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TABT.E 74 

DNA Sequence (SEQ ID NO:283) and Protein Sequence (SEQ ID NO:284) of A 1.4 



tct 


gat 


ggc 


agg 


aat 


gcc 


gca 


gcc 


aac 


gac 


aaa 


gcg 


tct 


ggc 


atg 


age 


Ser 


Asp 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Asn 


Asp 


Lys 


Ala 


Ser 


Gly 


Met 


Ser 


gcg 


ctg 


gcc 


gtc 


aat 


gaa 


tgc 


tgt 


acc 


aac 


cct 


gtc 


tgt 


cac 


gcg 


gaa 


Ala 


Leu 


Ala 


Val 


Asn 


Glu 


Cys 


Cys 


Thr 


Asn 


Pro 


Val 


Cys 


His 


Ala 


Glu 


cat 


caa 


gaa 


ctt 


tgt 


get 


aga 


aga 


cgc 


tga 














His 


Gin 


Glu 


Leu 


Cys 


Ala 


Arg 


Arg 


Arg 

















TABLE 75 

10 DNA Sequence (SEQ ID NO:285) and Protein Sequence (SEQ ID NO:286) of Al .5 





tct 


gat 


ggc 


agg 


aat 


gee 


gca 


gcc 


aac 


gac 


aaa 


gcg 


tct 


gac 


gtg 


ate 


Si 


Ser 


Asp 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Asn 


Asp 


Lys 


Ala 


Ser 


Asp 


Val 


He 




acg 


ctg 


gcc 


etc 


aag 


gga 


tgc 


tgt 


tee 


aac 


cct 


gtc 


tgt 


cac 


ttg 


gag 




Thr 


Leu 


Ala 


Leu 


Lys 


Gly 


Cys 


Cys 


Ser 


Asn 


Pro 


Val 


Cys 


His 


Leu 


Glu 


111 5 


cat 


tea 


aac 


ctt 


tgt 


ggt 


aga 


aga 


cgc 


tga 














i:.fl 


His 


Ser 


Asn 


Leu 


Cys 


Gly 


Arg 


Arg 


Arg 

















in 



TABLE 76 

DNA Sequence (SEQ ID NO:287) and Protein Sequence (SEQ ID NO:288) of A 1.6 



tct 


gat 


ggc 


agg 


aat 


gcc 


gca 


gcc 


aac 


gac 


aaa 


gcg 


tct 


ggc 


atg 


age 


Ser 


Asp 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Asn 


Asp 


Lys 


Ala 


Ser 


Gly 


Met 


Ser 


gcg 


ctg 


gcc 


gtc 


aat 


gaa 


tgc 


tgt 


acc 


aac 


cct 


gtc 


tgt 


cac 


gtg 


gaa 


Ala 


Leu 


Ala 


Val 


Asn 


Glu 


Cys 


Cys 


Thr 


Asn 


Pro 


Val 


Cys 


His 


Val 


Glu 


cat 


caa 


gaa 


ctt 


tgt 


get 


aga 


aga 


cgc 


tga 














His 


Gin 


Glu 


Leu 


Cys 


Ala 


Arg 


Arg 


Arg 

















25 TABLE 77 

DNA Sequence (SEQ ID NO:289) and Protein Sequence (SEQ ID NO:290) of Afl.l 





atg 


ttc 


ace 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gca 


acc 


acc 


gtc 


gtt 


tee 




Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Thr 


Val 


Val 


Ser 


30 


ttc 


act 


tea 


gat 


cgt 


gca 


ttt 


cgt 


ggc 


agg 


aat 


gcc 


gca 


gee 


aaa 


gcg 


Phe 


Thr 


Ser 


Asp 


Arg 


Ala 


Phe 


Arg 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Lys 


Ala 




tct 


ggc 


ctg 


gtc 


ggt 


ctg 


acc 


gac 


aag 


agg 


caa 


gaa 


tgc 


tgt 


tct 


tat 




Ser 


Gly 


Leu 


Val 


Gly 


Leu 


Thr 


Asp 


Lys 


Arg 


Gin 


Glu 


Cys 


Cys 


Ser 


Tyr 




,cct 


gcc 


tgt 


aac 


eta 


gat 


cat 


cca 


gaa 


ctt 


tgt 


ggt 


tgaagacget 






Pro 


Ala 


Cys 


Asn 


Leu 


Asp 


His 


Pro 


Glu 


Leu 


Cys 


Gly 











35 



gatgctccag gaccctctga accacgacgt 
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TABLE 78 

DNA Sequence (SEQ ID NO:291) and Protein Sequence (SEQ ID NO:292) of Afl.2 





atg 


ttc 


acc 


gtg 


ttt 


ctg 


ttg 


gtt 


gtc 


ttg 


gea 


acc 


act 


gtc 


gtt 


tec 




Met 


Phe 


Thr 


Val 


Phe 


Leu 


Leu 


Val 


Val 


Leu 


Ala 


Thr 


Thr 


Val 


Val 


Ser 


5 


tec 


act 


tea 


ggt 


cgt 


cgt 


gea 


ttt 


cgt 


ggc 


agg 


aat 


gcc 


gea 


gee 


aaa 




Ser 


Thr 


Ser 


Gly 


Arg 


Arg 


Ala 


Phe 


Arg 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Lys 




gcg 


tct 


gga 


etg 


gte 


ggt 


ctg 


act 


gae 


agg 


aga 


cca 


gaa 


tgc 


tgt 


agt 




Ala 


Ser 


Gly 


Leu 


Val 


Gly 


Leu 


Thr 


Asp 


Arg 


Arg 


Pro 


Glu 


Cys 


Cys 


Ser 


0 


gat 


cct 


cgc 


tgt 


aac 


teg 


act 


cat 


cca 


gaa 


ett 


tgt 


ggt 


gga 


aga 


cgc 


Asp 


Pro 


Arg 


Cys 


Asn 


Ser 


Thr 


His 


Pro 


Glu 


Leu 


Cys 


Gly 


Gly 


Arg 


Arg 



tgatgctcca ggaccctctg aaccacgacg t 



TABLE 79 





DNA 


Sequence 


(SEQ ID NO:293) and Protein Sequence (SEQ ID NO: 


:294) 


of Arl.2 


!ll5 

'S 


tct 
Ser 


gat ggc 
Asp Gly 


agg aat gcc 
Arg Asn Ala 


gea 
Ala 


gee aac gcg ttt gac ctg ate 
Ala Asn Ala Phe Asp Leu He 


gat 
Asp 


etg 
Leu 


tfl 


ace 
Thr 


gcc agg 
Ala Arg 


eta aat tgc 
Leu Asn Cys 


tgt 
Cys 


atg att ccc ccc tgt tg.g aag 
Met He Pro Pro Cys Trp Lys 


aaa 
Lys 


tat 
Tyr 


Gi 


gga 
Gly 


gac aga 
Asp Arg 


tgt agt gaa 
Cys Ser Glu 


gta 
Val 


cgc tgatgctcca ggaccctctg aaccacgacg 
Arg 


m 


t 
























TABLE 80 








DNA 


Sequence 


(SEQ ID NO:295) and Protein Sequence (SEQ ID NO: 


:296) 


of ArL3 




tct 
Ser 


gat ggc 
Asp Gly 


agg aat gcc 
Arg Asn Ala 


gea 
Ala 


cgc aaa gcg ttt ggc tgc tgc 
Arg Lys Ala Phe Gly Cys Cys 


gac 
Asp 


tta 
Leu 


25 


ata 
He 


ccc tgt 
Pro Cys 


ttg gag aga 
Leu Glu Arg 


tat 
Tyr 


ggt aac aga tgt aat gaa gtg 
Gly Asn Arg Cys Asn Glu Val 


cae 
His 






tgatgctcca ggaccctctg aaceacgcga cgt 
















TABLE 81 








DNA Sequence 


(SEQ ID NO:297) and Protein Sequence (SEQ ID NO 


:298) 


of Arl.4 


30 


tct 
Ser 


gat ggc 
Asp Gly 


age aat gcc 
Ser Asn Ala 


gea 
Ala 


gcc aac gag ttt gac ctg ate 
Ala Asn Glu Phe Asp Leu lie 


get 
Ala 


ctg 
Leu 




acc 
Thr 


gcc agg 
Ala Arg 


eta ggt tgc 
Leu Gly Cys 


tgt 
Cys 


aac gtt aca ccc tgt tgg gag 
Asn Val Thr Pro Cys Trp Glu 


aaa 
Lys 


tat 
Tyr 


35 


gga 
Gly 

t 


gac aaa 
Asp Lys 


tgt aat gaa 
Cys Asn Glu 


gta 
Val 


cgc tgatgcttca ggaccctctg aaccacgacg 
Arg 
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TABLE 82 

DNA Sequence (SEQ ID NO:299) and Protein Sequence (SEQ ID NO:300) of Arl .5 



tct 


gat 


ggc 


agg 


aat 


gtc 


gca 


gca aaa gcg 


ttt 


cac 


egg 


ate 


ggc 


egg 


Ser 


Asp 


Gly 


Arg 


Asn 


Val 


Ala 


Ala Lys Ala 


Phe 


His 


Arg 


He 


Gly 


Arg 


acc 


ate 


agg 


gat 


gaa 


tgc 


tgt 


tec aat cct 


gcc 


tgt 


agg 


gtg 


aat 


aat 


Thr 


He 


Arg 


Asp 


Glu 


Cys 


Cys 


Ser Asn Pro 


Ala 


Cys 


Arg 


Val 


Asn 


Asn 


cca 


cac 


gtt 


tgt 


aga 


cga 


cgc 


tgatgctcca 


ggaccctctg aaccacgacg t 


Pro 


His 


Val 


Cys 


Arg 


Arg 


Arg 

















TABLE 83 

10 DNA Sequence (SEQ ID NO:301) and Protein Sequence (SEQ ID NO:302) of Arl.6 



tet 


gat 


ggc 


agg 


aat 


gcc 


gca 


gcc 


aac gcg ttt 


gac ctg atg 


cct 


ctg 


Ser 


Asp 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Asn Ala Phe 


Asp Leu Met 


Pro 


Leu 


acc 


gcc 


agg 


eta 


aat 


tgc 


tgt 


age 


att cee ggc 


tgt tgg aac 


gaa 


tat 


Thr 


Ala 


Arg 


Leu 


Asn 


Cys 


Cys 


Ser 


He Pro Gly 


Cys Trp Asn 


Glu 


Tyr 


aaa 


gac 


aga 


tgt 


agt 


aaa 


gta 


cgc 


tgatgctcca i 


ggaccctctg aaccacgacg 


Lys 


Asp 


Arg 


Cys 


Ser 


Lys 


Val 


Arg 











t 



TABLE 84 

DNA Sequence (SEQ ID NO:303) and Protein Sequence (SEQ ID NO:304) of Ayl.2 

20 tetgatggea ggaatgcege agecgaegae aaagcgtetg acctggtegc t ctg gtc 

Leu Val 

gtc agg gga gga tgc tgt tec cac cct gtc tgt tae ttt aat aat cea 
Val Arg Gly Gly Cys Cys Ser His Pro Val Cys Tyr Phe Asn Asn Pro 

caa atg tgt cgt gga aga cgc tgatgctcca ggaccctctg aaccacgacg t 
25 Gin Met Cys Arg Gly Arg Arg 

TABLE 85 

DNA Sequence (SEQ ID NO:305) and Protein Sequence (SEQ ID NO:306) of Ay 1. 3 

tetgatggea ggaatgcege agecgaegae* aaagcgtetg aeetggtege t ctg gee 

Leu Ala 

30 gtc agg gga gga tgc tgt tee cac eet gtc tgt aac ttg aat aat eca 

Val Arg Gly Gly Cys Cys Ser His Pro Val Cys Asn Leu Asn Asn Pro 

caa atg tgt cgt gga aga cgc tgatgctcca ggaccctctg aaccacgacg t 
Gin Met Cys Arg Gly Arg Arg 



35 



TABLE 86 

DNA Sequence (SEQ ID NO:307) and Protein Sequence (SEQ ID NO:308) of Btl.8 
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ttt cgt ggc agg aat ccc gca gcc aac gac aaa agg tct gac ctg gcc 
Phe Arg Gly Arg Asn Pro Ala Ala Asn Asp Lys Arg Ser Asp Leu Ala 

get ctg age gte agg gga gga tgc tgt tec eat cet gcc tgt age gtg 
Ala Leu Ser Val Arg Gly Gly Cys Cys Ser His Pro Ala Cys Ser Val 

act cat cca gag ctt tgt ggc tgaagaeget gatgeeeeag gaecctctga 
Thr His Pro Glu Leu Cys Gly 



accacgacgt 



TABLE 87 

DNA Sequence (SEQ ID NO:309) and Protein Sequence (SEQ ID NO:310) of Btl.9 



tct 


gat 


ggc 


ggg 


aat 


gcc 


gca 


gcc 


aaa 


gcg 


tct 


gac ctg ate 


get 


cag 


Ser 


Asp 


Gly 


Gly 


Asn 


Ala 


Ala 


Ala 


Lys 


Ala 


Ser 


Asp Leu He 


Ala 


Gin 


acc 


ate 


agg 


gga 


gga 


tgc 


tgt 


tec 


tat 


ect 


gee 


tgt age gtg 


gaa 


cat 


Thr 


He 


Arg 


Gly 


Gly 


Cys 


Cys 


Ser 


Tyr 


Pro 


Ala 


Cys Ser Val 


Glu 


His 


caa 


gac 


ctt 


tgt 


gat 


gga 


aga 


cgc 


tgatgctcca 


ggaceetetg aaceacgacg 


Gin 


Asp 


Leu 


Cys 


Asp 


Gly 


Arg 


Arg 















t 



TABLE 88 

DNA Sequence (SEQ ID NO:311) and Protein Sequence (SEQ ID NO:3I2) of Cal.3 

tet tat ggc agg aat gcc gca gcc aaa gcg ttt gaa gtg agt tgc tgt 
Ser Tyr Gly Arg Asn Ala Ala Ala Lys Ala Phe Glu Val Ser Cys Cys 

gte gtt cgc ccc tgt tgg att cgc tat caa gag gaa tgt ctt gaa gca 
Val Val Arg Pro Cys Trp He Arg Tyr Gin Glu Glu Cys Leu Glu Ala 

gat ccc agg acc etc tga 
Asp Pro Arg Thr Leu 



TABLE 89 

DNA Sequence (SEQ IDNO:313) and Protein Sequence (SEQ ID NO:314) of CaL4 



tet 


gat 


ggc 


agg 


aat 


gcc 


gca 


gcc 


aac 


gcc 


ctt 


gac 


ctg 


ate 


act 


etg 


Ser 


Asp 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Asn 


Ala 


Leu 


Asp 


Leu 


He 


Thr 


Leu 


ate 


gee 


agg 


caa 


aat 


tgc 


tgt 


age 


att 


ccc 


ggc 


tgt 


tgg 


gag 


aaa 


tat 


He 


Ala 


Arg 


Gin 


Asn 


Cys 


Cys 


Ser 


He 


Pro 


Gly 


Cys 


Trp 


Glu 


Lys 


Tyr 


gga 


gac 


aaa 


tgt 


agt 


gaa 


gta 


cgc 


tga 
















Gly 


Asp 


Lys 


Cys 


Ser 


Glu 


Val 


Arg 



















TABLE 90 

DNA Sequence (SEQ ID NO:315) and Protein Sequence (SEQ ID NO:316) of CI.2 

tct gat ggc agg aat gaa gca gcc aac gac gaa gcg tct gac gtg ate 
Ser Asp Gly Arg Asn Glu Ala Ala Asn Asp Glu Ala Ser Asp Val lie 
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gag ctg gcc etc aag gga tgc tgt tec aac cct gtc tgt cac ttg gag 
Glu Leu Ala Leu Lys Gly Cys Cys Ser Asn Pro Val Cys His Leu Glu 

cat cca aac get tgt ggt aga aga cgc tgatgctcca ggaccctctg 
His Pro Asn Ala Cys Gly Arg Arg Arg 

aaccacgacg t 



TABLE 91 

DNA Sequence (SEQ ID NO:317) and Protein Sequence (SEQ ID NO:318) of CI. 3 

tct gat ggc agg aat gee gca gee aac gae aaa gcg tct gae ctg gtc 
Ser Asp Gly Arg Asn Ala Ala Ala Asn Asp Lys Ala Ser Asp Leu Val 

10 get ctg gcc gtc agg gga tgc tgt tec aac cct ate tgt tac ttt aat 

Ala Leu Ala Val Arg Gly Cys Cys Ser Asn Pro lie Cys Tyr Phe Asn 

aat cca cga att tgt cgt gga aga cgc tgatgctcca ggaccctctg 
Asn Pro Arg lie Cys Arg Gly Arg Arg 

aaccacgacg t 



15 TABLE 92 

DNA Sequence (SEQ ID NO:319) and Protein Sequence (SEQ ID NO:320) of Epl.2 

tct cat ggc agg aat gcc gca cgc aaa gcg tct gae ctg ate get ctg 

Ser His Gly Arg Asn Ala Ala Arg Lys Ala Ser Asp Leu lie Ala Leu 

ace gtc agg gaa tgc tgt tct cag cct cec tgt cgc tgg aaa eat cca 

20 Thr Val Arg Glu Cys Cys Ser Gin Pro Pro Cys Arg Trp Lys His Pro 

gaa ett tgt agt tga 

Glu Leu Cys Ser 



25 



30 



TABLE 93 

DNA Sequence (SEQ ID NO:321) and Protein Sequence (SEQ ID NO:322) of Gl .1 



tct 


gat 


ggc 


agg 


aat 


gae 


gca 


gcc 


aaa gcg ttt 


gae 


ctg ata 


tct 


teg 


Ser 


Asp 


Gly 


Arg 


Asn 


Asp 


Ala 


Ala 


Lys Ala Phe 


Asp 


Leu He 


Ser 


Ser 


ace 


gtc 


aag 


aaa 


gga 


tgc 


tgt 


tec 


cat cct gcc 


tgt 


gcg ggg 


aat 


aat 


Thr 


Val 


Lys 


Lys 


Gly 


Cys 


Cys 


Ser 


His Pro Ala 


Cys 


Ala Gly 


Asn 


Asn 


caa 


eat 


att 


tgt 


ggc 


cga 


aga 


cgc 


tgatgctcca - 


ggaccctctg aaccacgacg 


Gin 


His 


He 


Cys 


Gly 


Arg 


Arg 


Arg 













t 



TABLE 94 

DNA Sequence (SEQ ID NO:323) and Protein Sequence (SEQ ID NO:324) of G1.3 
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tct gat ggc agg aat gcc gca gcc aac gae caa gcg tct gae ctg atg 
Ser Asp Gly Arg Asn Ala Ala Ala Asn Asp Gin Ala Ser Asp Leu Met 
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TABLE 95 

DNA Sequence (SEQ ID NO:325) and Protein Sequence (SEQ ID NO:326) of Iml .3 



TABLE 96 

DNA Sequence (SEQ ID NO:327) and Protein Sequence (SEQ ID NO:328) of Lvl.2 





get 
Ala 


gcg acc 
Ala Thr 




aat 
Asn 


cca gac 
Pro Asp 


5 


aaccacgacg t 






ocC|uence 




ett 
Lb u 


gat gaa 

i-lSp ^jJLU 


10 


get 
Ala 


caa ate 
Gin He 




aat 
Asn 


aat cca 
Asn Pro 


15 


UJN A 


Sequence 




tct, 
Ser 


gat ggc 




gag 

o J_ U 


gaa tgc 
o J- u, y £3 


20 


tgt 
Cys 


ggc gga 
Gly Gly 




DNA 


Sequence 


25 


tet 
Ser 


aat ggc 
Asn Gly 




aag 
Lys 


egg acc 
Arg Thr 




aaa 
Lys 


cat caa 
His Gin 


30 


aeeaegacgt 
DNA Sequence 




tet 
Ser 


aat ggc 
Asn Gly 


35 


gag 
Glu 


ctg acc 
Leu Thr 



TABLE 97 



TABLE 98 
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gga cat caa gac ctg tgt ggc tgaagacgct gatgctccag gaccctctga 
Gly His Gin Asp Leu Cys Gly 

accacgacgt 

TABLE 99 

DNA Sequence (SEQ ID NO:333) and Protein Sequence (SEQ ID NO:334) of Lvl.6 



gca 


ttt 


gat 


ggc 


agg 


aat 


get 


gca 


gcc 


age 


gac 


aaa geg 


tec 


gag ctg 


Ala 


Phe 


Asp. 


Gly 


Arg 


Asn 


Ala 


Ala 


Ala 


Ser 


Asp 


Lys Ala 


Ser 


Glu Leu 


atg 


get 


ctg 


gcc 


gtc 


agg 


gga 


tgc 


tgt 


tec 


cat 


cct gee 


tgt 


get ggg 


Met 


Ala 


Leu 


Ala 


Val 


Arg 


Gly 


Cys 


Cys 


Ser 


His 


Pro Ala 


Cys 


Ala Gly 


agt 


aat 


gca 


cat 


ate 


tgt 


ggc 


aga 


aga 


cge 


tgatgetcca ^ 


ggaccctctg 


Ser 


Asn 


Ala 


His 


He 


Cys 


Gly 


Arg 


Arg 


Arg 











aaceacgacg t 

TABLE 100 

DNA Sequence (SEQ ID NO:335) and Protein Sequence (SEQ ID NO:336) of Lvl.7 

tet aat ggc agg aat gcc gca gcc aaa ttc aaa gcg cct gcc ctg atg 
Ser Asn Gly Arg Asn Ala Ala Ala Lys Phe Lys Ala Pro Ala Leu Met 

aag ctg ace gtc agg gag gat tgc tgt tea gac cct cge tgt tec gtg 
Lys Leu Thr Val Arg Glu Asp Cys Cys Ser Asp Pro Arg Cys Ser Val 

gga cat caa gac atg tgt ggc tgaagacgct gatgctccag gaccctctga 
Gly His Gin Asp Met Cys Gly 

ateaegacgt 

TABLE 101 

DNA Sequence (SEQ ID NO:337) and Protein Sequence (SEQ ID NO:338) of Lvl.8 

ttt gaa tgc agg aat get gca ggc aac gac aaa gcg act gac ctg atg 
Phe Glu Cys Arg Asn Ala Ala Gly Asn Asp Lys Ala Thr Asp Leu Met 

get ctg act gtc agg gga tgc tgt tee eat cct gcc tgt get ggg aat 
Ala Leu Thr Val Arg Gly Cys Cys Ser His Pro Ala Cys Ala Gly Asn 

aat cea cat ate tgc ggc tgaagacgct gatgctccag gaccctctga 
Asn Pro His lie Cys Gly 

accacgacgt 

TABLE 102 

DNA Sequence (SEQ ID NO:339) and Protein Sequence (SEQ ID NO:340) of Lvl.9 

ttt gat ggc agg aac gee gca gee aac aac aaa gcg act gat ctg atg 
Phe Asp Gly Arg Asn Ala Ala Ala Asn Asn Lys Ala Thr Asp Leu Met 

get ctg act gtc aga gga tgc tgt ggc aat cct tea tgt age ate eat 
Ala Leu Thr Val Arg Gly Cys Cys Gly Asn Pro Ser Cys Ser He His 
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att cct tac gtt tgt aat tagagacact gatgctccag gaccctctga 
lie Pro Tyr Val Cys Asn 

accacgacgt 



TABLE 103 

5 DNA Sequence (SEQ ID NO:341) and Protein Sequence (SEQ ID NO:342) of Lvl.lO 

tct aat ggc agg aat gcc gca gcc aaa ttc aaa gcg cct gcc ctg atg 
Ser Asn Gly Arg Asn Ala Ala Ala Lys Phe Lys Ala Pro Ala Leu Met 

aag egg acc gac age gaa gaa tgc tgt tta gac tct cgc tgt gcc ggg 
Lys Arg Thr Asp Ser Glu Glu Cys Cys Leu Asp Ser Arg Cys Ala Gly 

10 caa cat caa gac ctg tgt ggc gga aga cgc tgatgctcca ggaccctctg 

Gin His Gin Asp Leu Cys Gly Gly Arg Arg 

aaccacgacg t 

TABLE 104 

DNA Sequence (SEQ ID NO:343) and Protein Sequence (SEQ ID NO:344) of Mr 1 .3 

15 tct gat ggc agg aat gcc gca gcc aag gac aaa gcg tct gac ctg gtc 

Ser Asp Gly Arg Asn Ala' Ala Ala Lys Asp Lys Ala Ser Asp Leu Val 

get ctg acc gtc aag gga tgc tgt tct aat cct ccc tgt tac gcg aat 
Ala Leu Thr Val Lys Gly Cys Cys Ser Asn Pro Pro Cys Tyr Ala Asn 

aat caa gcc tat tgt aat gga aga cgc tga 
20 Asn Gin Ala Tyr Cys Asn Gly Arg Arg 



TABLE 105 

DNA Sequence (SEQ ID NO:345) and Protein Sequence (SEQ ID NO:346) of Mr 1.4 

tct gat ggc agg aat gcc gca gcc aag gac aaa gcg tct gac ctg gtc 
Ser Asp Gly Arg Asn Ala Ala Ala Lys Asp Lys Ala Ser Asp Leu Val 

25 get ctg acc gtc aag gga tgc tgt tct cat cct gcc tgt age gtg aat 

Ala Leu Thr Val Lys Gly Cys Cys Ser His Pro Ala Cys Ser Val Asn 

aat cca gac att tgt ggt tga 
Asn Pro Asp lie Cys Gly 

TABLE 106 

30 DNA Sequence (SEQ ID NO:347) and Protein Sequence (SEQ ID NO:348) of Msl.l 

tct gat ggc agg aat get gca gcc aac aac aaa gtg get ttg acc atg 
Ser Asp Gly Arg Asn Ala Ala Ala Asn Asn Lys Val Ala Leu Thr Met 

agg gga aaa tgc tgt ate aat gat gcg tgt cgc teg aaa eat cca eag 
Arg Gly Lys Cys Cys He Asn Asp Ala Cys Arg Ser Lys His Pro Gin 

35 tac tgt tct gga aga cgc tgatacteca ggaccctctg aaccacgacg t 

Tyr Cys Ser Gly Arg Arg 
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TABLE 107 

DNA Sequence (SEQ ID NO:349) and Protein Sequence (SEQ ID NO:350) of Msl.6 

tct gat ggc agg aat get gca gcc aac gac aaa gtg tct gac cag atg 
Ser Asp Gly Arg Asn Ala Ala Ala Asn Asp Lys Val Ser Asp Gin Met 

get ctg gtt gtc agg gga tgc tgt tac aat att gcc tgt aga att aat 
Ala Leu Val Val Arg Gly Cys Cys Tyr Asn lie Ala Cys Arg lie Asn 

aat cca egg tac tgt cgt gga aaa ege tgatgttcea ggaceetctg 
Asn Pro Arg Tyr Cys Arg Gly Lys Arg 

aaccaegaeg t 



TABLE 108 

DNA Sequence (SEQ ID NO:351) and Protein Sequence (SEQ ID NO:352) of Ol.l 

tctgaaggca ggaatgeege agceaacgae aaagcgtctg acctgatggc t ctg aac 

Leu Asn 

gtc agg gga tgc tgt tec cat cct gtc tgt cgc ttc aat tat cca aaa 
Val Arg Gly Cys Cys Ser His Pro Val Cys Arg Phe Asn Tyr Pro Lys 

tat tgt ggt gga aga cgc tgatggteca ggaceetctg aaccaegaeg t 
Tyr Cys Gly Gly Arg Arg 



TABLE 109 

DNA Sequence (SEQ ID NO:353) and Protein Sequence (SEQ ID NO:354) of 01.2 

tctgatggcg ggaatgeege ageaaaageg tttgatetaa teact ctg gee etc agg 

Leu Ala Leu Arg 

gat gaa tgc tgt gee agt cct ece tgt cgt ttg aat aat cca tac gta 
Asp Glu Cys Cys Ala Ser Pro Pro Cys Arg Leu Asn Asn Pro Tyr Val 

tgt cat tgaegaegct gatgetceag gacectctga accaegacgt 
Cys His 



TABLE 110 

DNA Sequence (SEQ ID NO:355) and Protein Sequence (SEQ ID NO:356) of 01.4 



atg 
Met 


ttc 
Phe 


ace 
Thr 


gtg 
Val 


ttt 
Phe 


ctg 
Leu 


ttg 
Leu 


gtt 
Val 


gtc 
Val 


ttg 
Leu 


gca 
Ala 


acc 
Thr 


ace 
Thr 


gtc 
Val 


gtt 
Val 


tee 
Ser 


ece 
Pro 


act 
Thr 


tea 
Ser 


gat 
Asp 


cgt 
Arg 


gca 
Ala 


tct 
Ser 


gat 
Asp 


agg 
Arg 


agg 
Arg 


aat 
Asn 


gee 
Ala 


gca 
Ala 


gee 
Ala 


aaa 
Lys 


gcg 
Ala 


ttt 
Phe 


gac 
Asp 


ctg 
Leu 


aga 
Arg 


tat 
Tyr 


teg 
Ser 


acc 
Thr 


gcc 
Ala 


aag 
Lys 


aga 
Arg 


gga 
Gly 


tgc 
Cys 


tgt 
Cys 


tec 
Ser 


aat 
Asn 


cct 
Pro 


gtc 
Val 


tgt 
Cys 


tgg 
Trp 


cag 
Gin 


aat 
Asn 


aat 
Asn 


gca 
Ala 


gaa 
Glu 


tac 
Tyr 


tgt 
Cys 


cgt 
Arg 


gaa 
Glu 


agt 
Ser 


ggc 
Gly 







taatgcteca ggaceetctg aaccaegaeg t 
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TABLE 1 1 1 



DNA Sequence (SEQ ID NO:357) and Protein Sequence (SEQ ID NO:358) of 0 1.7 

atg ttc acc gtg ttt ctg ttg gtt gtc ttg gca acc acc gtc gtt tec 
Met Phe Thr Val Phe Leu Leu Val Val Leu Ala Thr Thr Val Val Ser 



ttc 


act 


tea 


gat 


cgt 


gca 


tct 


gat 


Phe 


Thr 


Ser 


Asp 


Arg 


Ala 


Ser 


Asp 


ctg 


ate 


get 


ctg 


acc 


ate 


aag 


gga 


Leu 


lie 


Ala 


Leu 


Thr 


He 


Lys 


Gly 


cag 


aat 


aat 


caa 


gac 


tat 


tgt 


ggt 


Gin 


Asn 


Asn 


Gin 


Asp 


Tyr 


Cys 


Gly 



accacgacgt 



gge ggg aat gtc gca gcg tct cac 
Gly Gly Asn Val Ala Ala Ser His 

tgc tgt tct cac cct ccc tgt gcc 
Cys Cys Ser His Pro Pro Cys Ala 

tgacgacgct gatgctccag gaecctctga 



TABLE 112 

DNA Sequence (SEQ ID NO:359) and Protein Sequence (SEQ ID NO:360) of 0 1.8 

atg ttc acc gtg ttt ctg ttg gtt gtc tta tea acc acc gtc gtt tec 
Met Phe Thr Val Phe Leu Leu Val Val Leu Ser Thr Thr Val Val Ser 



tec 


act 


tea 


gat 


cgt 


gca 


tct 


gat 


Ser 


Thr 


Ser 


Asp 


Arg 


Ala 


Ser 


Asp 


tct 


gac 


ctg 


atg 


tat 


teg 


acc 


gtc 


Ser 


Asp 


Leu 


Met 


Tyr 


Ser 


Thr 


Val 


gcc 


tgt 


teg 


ggg 


aat 


aat 


ega 


gaa 


Ala 


Cys 


Ser 


Gly 


Asn 


Asn 


Arg 


Glu 



agg 


agg 


aat 


gee 


gca 


gee 


aaa 


gcg 


Arg 


Arg 


Asn 


Ala 


Ala 


Ala 


Lys 


Ala 


aag 


aaa 


gga 


tgt 


tgt 


tee 


eat 


cct 


Lys 


Lys 


Gly 


Cys 


Cys 


Ser 


His 


Pro 


tat 


tgt 


cgt 


gaa 


agt 


gge 






Tyr 


Cys 


Arg 


Glu 


Ser 


Gly 







taatgctcca ggaccctctg aaccacgacg t 



TABLE 113 

DNA Sequence (SEQ ID NO:361) and Protein Sequence (SEQ ID NO:362) of Oml.2 

tttgatggca ggaatgcctc agccgacagc aaagtggctg cccggatcgc t cag ate 

Gin lie 

gac agg gat cea tgc tgt tec tat cct gac tgt gge gcg aat cat cca 
Asp Arg Asp Pro Cys Cys Ser Tyr Pro Asp Cys Gly Ala Asn His Pro 

gag att tgt ggt gga aaa cgc tgatgetcca ggaccctctg aaccacgacg t 
Glu lie Cys Gly Gly Lys Arg 



TABLE 114 

DNA Sequence (SEQ ID NO:363) and Protein Sequence (SEQ ID NO:364) of Oml.3 

tcteatggca ggaatgccgc acget ctg acc gtc agg gaa tgc tgt tct cag 

Leu Thr Val Arg Glu Cys Cys Ser Gin 

cct cct tgt cgc tgg aaa cat cca gaa ctt tgt agt tgaagacgct 
Pro Pro Cys Arg Trp Lys His Pro Glu Leu Cys Ser 
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gatgctccag gaccctctga accacgacgt 

TABLE 115 

DNA Sequence (SEQ ID NO:365) and Protein Sequence (SEQ ID NO:366) of Oml.4 

tttgatggca ggaatgctgc agccagcgac aaagcgtctg agctgatggc t ctg gcc 

Leu Ala 

gtc agg gga tgc tgt tec cat cct gcc tgt get ggg aat aat cca cat 
Val Arg Gly Cys Cys Ser His Pro Ala Cys Ala Gly Asn Asn Pro His 

ate tgt ggc aga aga cgc tgatgetcca ggaccctctg aaccacgacg t 
He Cys Gly Arg Arg Arg 



TABLE 116 

DNA Sequence (SEQ ID NO:367) and Protein Sequence (SEQ ID NO:368) of Oml.5 

tctggtgtca ggaaagacgc agegectgge etgatcgct ctg acc ate aag gga 

Leu Thr He Lys Gly 

tgc tgt tct gat cct age tgt aac gtg aat aat cca gae tat tgt ggt 
Cys Cys Ser Asp Pro Ser Cys Asn Val Asn Asn Pro Asp Tyr Cys Gly 

tgacgacgct gatgctccag gaccctctga accacgacgt 



TABLE 117 

DNA Sequence (SEQ ID NO:369) and Protein Sequence (SEQ ID NO:370) of Oml.6 

tctaatggca ggaatgccge ageeaaattc aaagcgcctg ccctgatgga g ctg acc 57 

Leu Thr 

gtc agg gaa gaa tgc tgt tea gae cct ege tgt tee gtg gga eat caa 10 
Val Arg Glu Glu Cys Cys Ser Asp Pro Arg Cys Ser Val Gly His Gin 

gat atg tgt egg tgaagcacgt gatgctccag gaccctctga accacgacgt 15 
Asp Met Cys Arg 



TABLE 118 

DNA Sequence (SEQ ID NO:371) and Protein Sequence (SEQ ID NO:372) of PI .4 

act gat ggc agg aat get gca gcc ata gcg ctt gae ctg ate get ccg 
Thr Asp Gly Arg Asn Ala Ala Ala lie Ala Leu Asp Leu He Ala Pro 

gee gtc agg gga gga tgc tgt tec aat cct gee tgt tta gtg aat cat 
Ala Val Arg Gly Gly Cys Cys Ser Asn Pro Ala Cys Leu Val Asn His 

eta gaa atg tgt ggt aaa aga cgc tgatgcccca ggaccctctg aaeeaegaeg 
Leu Glu iy3et Cys Gly Lys Arg Arg 
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TABLE 119 

DNA Sequence (SEQ ID NO:373) and Protein Sequence (SEQ ID NO:374) of PI .5 

tct gat ggc agg gat gcc gca gcc aac gac aaa gcg tct gac ctg ate 
Ser Asp Gly Arg Asp Ala Ala Ala Asn Asp Lys Ala Ser Asp Leu He 

5 get ctg ace gcc agg aga gat cca tgc tgt ttc aat ect gcc tgt aac 

Ala Leu Thr Ala Arg Arg Asp Pro Cys Cys Phe Asn Pro Ala Cys Asn 

gtg aat aat cca cag att tgt ggt tgaagacgct gatgetecag gaccctctga 
Val Asn Asn Pro Gin He Cys Gly 

accacgacgt 

10 TABLE 120 

DNA Sequence (SEQ ID NO:375) and Protein Sequence (SEQ ID NO:376) of PI. 6 





tct 
Ser 


gat 
Asp 


ggc 
Gly 


agg 
Arg 


gat 
Asp 


get 
Ala 


gag 
Glu 


aaa aca ggc ttt gac aeg acc 
Lys Thr Gly Phe Asp Thr Thr 


att 
He 


gtg 
Val 


15 


ccg 
Pro 


gaa 
Glu 


gac 
Asp 


tgc 
Cys 


tgt 
Cys 


teg 
Ser 


gat 
Asp 


cet tee tgt tgg agg ctg eat 
Pro Ser Cys Trp Arg Leu His 


agt 
Ser 


tta 
Leu 




get 
Ala 

t 


tgt 
Cys 


act 
Thr 


gga 
Gly 


att 
He 


gta 
Val 


aac 
Asn 


egc tgatgetcea ggaccetctg aaccaegacg 
Arg 


















TABLE 121 






20 


DNA Sequence (SEQ ID NO:377) and Protein Sequence (SEQ ID NO:378) of PI. 8 




act 
Thr 


gat 
Asp 


ggc 
Gly 


agg 
Arg 


agt 
Ser 


get 
Ala 


gca 
Ala 


gee ata gcg ttt gcc ctg ate 
Ala He Ala Phe Ala Leu He 


get 
Ala 


ccg 
Pro 




acc 
Thr 


gtc 
Val 


tgc 
Cys 


tgt 
Cys 


act 
Thr 


aat 
Asn 


cet 
Pro 


gee tgt etc gtg aat aat ata 
Ala Cys Leu Val Asn Asn He 


egc 
Arg 


ttt 
Phe 


25 


tgt 
Cys 


ggt 
Gly 


gga 
Gly 


aga 
Arg 


egc 
Arg 


tgatgcccca ggaccetctg aaccaegacg t 






















TABLE 122 








DNA 


Sequence 


(SEQ ID NO:379) and Protein Sequence (SEQ ID NO: 


380) 


ofRgLl 


30 


tct 
Ser 


gat 
Asp 


gga 
Gly 


aga 
Arg 


aat 
Asn 


gee 
Ala 


gca 

Ala 


age gac gee aaa gcg ttt ccc 
Ser Asp Ala Lys Ala Phe Pro 


egg 
Arg 


ate 
He 




get 
Ala 


cca 
Pro 


ate 
He 


gtc 
Val 


agg 
Arg 


gac 
Asp 


gaa 
Glu 


tgc tgt age gat cet agg tgt 
Cys Cys Ser Asp Pro Arg Cys 


cac 
His 


ggg 

Gly 




aat 
Asn 


aat 
Asn 


egg 
Arg 


gac 
Asp 


cac 
His 


tgt 

Cys 


get 
Ala 


tgaagacgct gctgetccag gaccctctga 


35 


accacgacgt 

















64 



TABLE 123 

DNA Sequence (SEQ ID NO:381) and Protein Sequence (SEQ ID NO:382) of Rgl.3 

tct gat ggc agg aat acc gcg gcc gac gaa aaa gcg tec gac ctg ate 
Ser Asp Gly Arg Asn Thr Ala Ala Asp Glu .Lys Ala Ser Asp Leu lie 

tet caa act gtc aag aga gat tgc tgt tec cat cet etc tgt aga tta 
Ser Gin Thr Val Lys Arg Asp Cys Cys Ser His Pro Leu Cys Arg Leu 

ttt gtt cca gga ctt tgt att tgaagacget gctgctccag gaecctctga 
Phe Val Pro Gly Leu Cys lie 

accacgact 



TABLE 124 

DNA Sequence (SEQ ID NO:383) and Protein Sequence (SEQ ID NO:384) of RgK4 

tct gat ggc agg aat gcc gca gcc gac aac aaa gcg tet gac eta ate 
Ser Asp Gly Arg Asn Ala Ala Ala Asp Asn Lys Ala Ser Asp Leu lie 

get caa ate gte agg aga gga tgc tgt tec cat cet gtc tgt aaa gtg 
Ala Gin lie Val Arg Arg Gly Cys Cys Ser His Pro Val Cys Lys Val 

agg tat eca gac ctg tgt cgt tgaagacget gctgctccag gaecctctga 
Arg Tyr Pro Asp Leu Cys Arg 

accacgacgt 



TABLE 125 

DNA Sequence (SEQ ID NO:385) and Protein Sequence (SEQ ID NO:386) of Rgl.5 

tct gat ggc agg aat gcc gca gcc gac aac aga gcg tct gac eta ate 
Ser Asp Gly Arg Asn Ala Ala Ala Asp Asn Arg Ala Ser Asp Leu lie 

get caa ate gte agg aga gga tgc tgt tec cat cet gcc tgt aat gtg 
Ala Gin lie Val Arg Arg Gly Cys Cys Ser His Pro Ala Cys Asn Val 

aat aat cca cac att tgt ggt tgaagacget gctgctccag gaecctctga 
Asn Asn Pro His lie Cys Gly 

accacgacgt , 



TABLE 126 

DNA Sequence (SEQ ID NO:387) and Protein Sequence (SEQ ID NO:388) of RgL8 

tct gat ggc agg aat gee gca gee gae aac aaa eeg tet gac eta ate 
Ser Asp Gly Arg Asn Ala Ala Ala Asp Asn Lys Pro Ser Asp Leu lie 

get caa ate gtc agg aga gga tgc tgt teg cat cet gte tgt aaa gtg 
Ala Gin lie Val Arg Arg Gly Cys Cys Ser His Pro Val Cys Lys Val 

agg tat tea gac atg tgt ggt tgaagacget gctgctccag gaecctctga 
Arg Tyr Ser Asp Met Cys Gly 

accacgacgt 
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TABLE 127 

DNA Sequence (SEQ ID NO:389) and Protein Sequence (SEQ ID NO:390) of Sml.4 

tct gat ggc agg aat gca gag cga cga caa age gtc tgt cct ggt cgc 
Ser Asp Gly Arg Asn Ala Glu Arg Arg Gin Ser Val Cys Pro Gly Arg 

tct ggc ccc agg gga gga tgt tgt tec eac cct gee tgt aag gtg cat 
Ser Gly Pro Arg Gly Gly Cys Cys Ser His Pro Ala Cys Lys Val His 

ttt eea eac agt tgt ggt tgacgacgct gatgctccag gaccetctga 
Phe Pro His Ser Cys Gly 

accacgacgt 



TABLE 128 

DNA Sequence (SEQ ID NO:391) and Protein Sequence (SEQ ID NO:392) of Sml.5 



tct 


gat 


ggc 


agg 


aat 


gcc 


gca gee age 


gae aga 


gcg 


tct 


gae 


gcg 


gcc 


Ser 


Asp 


Gly 


Arg 


Asn 


Ala 


Ala Ala Ser 


Asp Arg 


Ala 


Ser 


Asp 


Ala 


Ala 


cac 


cag 


gta 


tgc 


tgt 


tec 


aae cct gtc 


tgt cac 


gtg 


gat 


cat 


eea 


gaa 


His 


Gin 


Val 


Cys 


Cys 


Ser 


Asn Pro Val 


Cys His 


Val 


Asp 


His 


Pro 


Glu 


ctt 


tgt 


cgt 


aga 


aga 


cgc 


tgatgetcea ' 


ggacectctg aaceacgacg 


\ t 




Leu 


Cys 


Arg 


Arg 


Arg 


Arg 

















TABLE 129 

DNA Sequence (SEQ ID NO:393) and Protein Sequence (SEQ ID NO:394) of SI .5 

tct gat ggc agg aat gee gcg gee aae gae aaa gcg tct gae ctg gtc 
Ser Asp Gly Arg Asn Ala Ala Ala Asn Asp Lys Ala Ser Asp Leu Val 

get ccg gcc ate agg gga tgc tgt tec cac cct gtc tgt aae ttg agt 
Ala Pro Ala lie Arg Gly Cys Cys Ser His Pro Val Cys Asn Leu Ser 

aat eea caa att tgt cgt gga aga cgc tgatgetcea ggacectctg 
Asn Pro Gin lie Cys Arg Gly Arg Arg 

aaceacgacg t 



TABLE 130 

DNA Sequence (SEQ ID NO:395) and Protein Sequence (SEQ ID NO:396) of TxL5 

ttt cat ggc agg aat gee gca gee aaa gcg tct ggc ctg gtc ggt ctg 
Phe His Gly Arg Asn Ala Ala Ala Lys Ala Ser Gly Leu Val Gly Leu 

ace gae aag agg caa gaa tgc tgt tct eat cct gcc tgt aae gta gat 
Thr Asp Lys Arg Gin Glu Cys Cys Ser His Pro Ala Cys Asn Val Asp 

eat cca gaa att tgt cgt tga 
His Pro Glu lie Cys Arg 



• ft 
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TABLE 131 

DNA Sequence (SEQ ID NO:397) and Protein Sequence (SEQ ID NO:398) of Tl .1 

act gat ggc agg agt get gca gcc ata gcg ttt gee ctg ate get ccg 
Thr Asp Gly Arg Ser Ala Ala Ala lie Ala Phe Ala Leu lie Ala Pro 

acc gtc tgg gaa gga tgc tgt tct aat cct gcc tgt etc gtg aat eat 
Thr Val Trp Glu Gly Cys Cys Ser Asn Pro Ala Cys Leu Val Asn His 

ata ege ttt tgt ggt gga aga cge tgatgeeeea ggaeeetetg aaeeacgacg 
lie Arg Phe Cys Gly Gly Arg Arg 



10 TABLE 132 

DNA Sequence (SEQ ID NO:399) and Protein Sequence (SEQ ID NO:400) of Vrl.3 

tct aat ggc atg aat gee gea gee ate agg aaa gcg tet gee etg gtg 
Ser Asn Gly Met Asn Ala Ala Ala lie Arg Lys Ala Ser Ala Leu Val 

get cag ate gcc eat ega gae tge tgt gae gat eet gee tge aee gtg 
15 Ala Gin lie Ala His Arg Asp Cys Cys Asp Asp Pro Ala Cys Thr Val 

aat aat eea ggc ett tgc act tgaagatget gctgccccag gaccctctga 
Asn Asn Pro Gly Leu Cys Thr 

accacgacgt 

TABLE 133 

20 DNA Sequence (SEQ ID NO:401) and Protein Sequence (SEQ ID NO:402) of Gl .2 

tct gat ggc ggg aat gee gea gea aaa gag tet gae gtg ate get ctg 
Ser Asp Gly Gly Asn Ala Ala Ala Lys Glu Ser Asp Val lie Ala Leu 

aee gtc tgg aaa tgc tgt acc att cct tee tgt tat gag aaa aaa aaa 
Thr Val Trp Lys Cys Cys Thr lie Pro Ser Cys Tyr Glu Lys Lys Lys 

25 att aaa gca tgt gtc ttt tgaegaegct gatgctccag gaccctctga 

lie Lys Ala Cys Val Phe 

accacgacgt 



TABLE 134 

DNA Sequence (SEQ ID NO:403) and Protein Sequence (SEQ ID NO:404) of Rgl .12 

30 tct gat ggc gea gtc gae gae aaa gcg ttg gat ega ate get gaa ate 

Ser Asp Gly Ala Val Asp Asp Lys Ala Leu Asp Arg lie Ala Glu lie 

gtc agg aga gga tgc tgt ggc aat cct gee tgt age ggc tec teg aaa 
Val Arg Arg Gly Cys Cys Gly Asn Pro Ala Cys Ser Gly Ser Ser Lys 

gat gca ece tet tgt ggt tgaagaegct getgetccag gaccctctga 
35 Asp Ala Pro Ser Cys Gly 

accacgacgt 
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It will be appreciated that the methods and compositions of the instant invention can be 
incorporated in the form of a variety of embodiments, only a few of which are disclosed herein. It 
will be apparent to the artisan that other embodiments exist and do not depart from the spirit of the 
invention. Thus, the described embodiments are illustrative and should not be construed as 
restrictive. 
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